Gene Sama_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1336 
Symbol 
ID4603588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1627401 
End bp1629698 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content58% 
IMG OID639780686 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_927213 
Protein GI119774473 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.62013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAACC CATTTCTGCT GGGCTTTTGT GCCCTTGTCA TCTCATCGCT GCTCTGGCCA 
GTGCTGCCGC CCTGGCCGGC CTGCCTGCTT TTGCTTGTGG CATTGGGGTG TTACCGCAAG
TTGCCTGTGC TCAGTGGAGC CCTGACTGCC GTGGCGTGGG TTGCTGTGTA CACCCATATG
CTGTTCGATT ACAGCCACCT GCAACAAGCC GGTGATTCCT GGGTGCGTGG CGAGATAATA
GCACCTGTGT CTGACAGCGG CGACTGGCAG AGTATAGATA TACGTCTTGT TAAACCAAAA
TTAATCTGGC CTGTTGAGGG CAATATCAGG CTCAACTGGC GAACAACTGA CCAAATTCGT
CCCGGGGAGC AGTGGGAATT TCGTCTTGCA CCCCGTTCCA TCACCTCGCC CCTCAACGAA
GGCGCCTTCA ACGGTCAGCG TTACCTGCTA TCACGCCATG TGGGTATCAA GGCAAGAGTG
CTGGAAGCAA GAAAAGTATC CGAGGCGTCA GGGCTTCGCG GGCTATTACT GGAGCACATT
GGCCATGCCA TATCGGAGAA ACCCCGGCAG GCGCTGCTGT ACCCACTGCT GACGGGGGAA
CAGCAGGGCA TAGATGCGCA GACCTGGCAG CGGCTCAGGC AAACCGGCAC CGGGCACCTG
ATGGCCATCT CCGGGTTGCA TATGTCGGTG CTCGGAGCCT GGCTGTTGCT GCTGTGCCGC
GCGCTGTTGA CCTCGTTTGC GCCGCGCCAG GACAGGCGTA ATTTGGTGAT TGCCATGATA
GTCGCCACTG TGGGGTGTCT GCTCTATGGT CTGCTGGCGG GCATGGGCAT TCCTACCCGC
CGGGCCTTTA TCATGCTGGC ATTGGTGGTG CTGCTGACCC TCAGTCGCCG TTTCGCCTCG
CCCTGGGAGC GTTTGTTGTA CGCCCTTGCC GCTGTGTTGT TTCTCGATCC TTTGTCGCCA
CTGTCTGCCG GATTTTGGCT GTCTTTTGGC GCCATCGTGA TTATGTTACT GCTATTGGAC
CGGCCGCCCG CCCACCTTGA GGGTGTCCCC GGTCGATTAA AGCACTACCT GATGTCGCTG
GTGACGCTGC AGTTGGCACT CAGTATCGGA CTGGGAGTAT TGCAGCTGGT GCTCTTTGGC
GGTGTCAGTG TCCATAGCCT GTGGATTAAT CTGCTGATGG TGCCCTGGTT TTCGCTGGTG
GCCATTCCCC TGGCGTTGGC AGGGCTTGTG TTTTTTGTTC TCTTGCTGCC TTTTGGGATA
TTGGCCGACT GGGCGTTTAC TCCCGCCCTC ATGGCATTGA TACCGCTGGA TGGGCTTCTT
ACGTTCAGTG ACCACTTACC CGGCGCCTGG ATAAGCGTGC CGGCACAGTT GATTGCGCCC
CTGTGTTTTG CCATTGCAGG CGCTGTTTTA TTGTTTTTGC CTTTGGCGCG GGGTGTTAAG
TGGGTGAGCG CCTCCTTACT GTTGCCGCTA CTGATAACTC TGAGTGTGAA AGGCGGGCCC
CAATGGCAGA TGCATTTACT GGATGTAGGG CAGGGGCTGG CCGTCGTGGT TTTCAGCCGG
GATCAAACAC TGGTGTACGA CACCGGATTG GCCTTTGGTG ACACCTTCTC CCATGGTGAG
CGAACCCTGG TGCCATTTTT GCGGGCCAAG GGACGTAATC ACATCGATGT GTTGGTCATC
AGCCATGAGG ATAAAGACCA TGCCGGTGGC GCCGCCGCGC TGGCCAGAGC AATGCCAGTC
CACTTACTCA TCAGCGATAC CCGGGCTGCA AGGGATACAC TGGCGATGGA ACATGCGCCT
TGCCGCCCTC AGGCATTCGC CCTTGGCAAT CTGTGGGTAG AGGTCCTGTC GCCCGCAGAC
TCACCGGCTG GAAGAGTAGA CAATAATGCT TCCTGTGTGG TGACAGTGGG CGATGGTCAT
TCGCGGCTGC TGTTGCCCGG CGACATTGAA GCCGAAGGGG AGACGCGGCT CCTTGGCAGT
GGCGAGGCGT TGAACGCCAA TGTCTTAGTG GCACCCCACC ATGGCAGCCT GACGTCATCG
ACTCCGGCCT TTGTCGCGGG CGTAGCACCG GCCATCACCC TCTTTGCTGC CGGCGCCAAC
AACAGATACG GTTTTCCTAA AGACGCCGTG GTGCAGCGAT ACCTGGCCCA GGGCAGTCAA
ACGTTTACTG CGGCGGATAC CGGCCAGATA AGCCTGTACC TTGATGATGA AATCACAGTG
AAAACCTATC GGGGTTCGCT GGCCCCTTTT TGGTATAACC GGGTCTTTGG AGTTGGTGGC
AGGCCGATTA CAGAGTAG
 
Protein sequence
MKNPFLLGFC ALVISSLLWP VLPPWPACLL LLVALGCYRK LPVLSGALTA VAWVAVYTHM 
LFDYSHLQQA GDSWVRGEII APVSDSGDWQ SIDIRLVKPK LIWPVEGNIR LNWRTTDQIR
PGEQWEFRLA PRSITSPLNE GAFNGQRYLL SRHVGIKARV LEARKVSEAS GLRGLLLEHI
GHAISEKPRQ ALLYPLLTGE QQGIDAQTWQ RLRQTGTGHL MAISGLHMSV LGAWLLLLCR
ALLTSFAPRQ DRRNLVIAMI VATVGCLLYG LLAGMGIPTR RAFIMLALVV LLTLSRRFAS
PWERLLYALA AVLFLDPLSP LSAGFWLSFG AIVIMLLLLD RPPAHLEGVP GRLKHYLMSL
VTLQLALSIG LGVLQLVLFG GVSVHSLWIN LLMVPWFSLV AIPLALAGLV FFVLLLPFGI
LADWAFTPAL MALIPLDGLL TFSDHLPGAW ISVPAQLIAP LCFAIAGAVL LFLPLARGVK
WVSASLLLPL LITLSVKGGP QWQMHLLDVG QGLAVVVFSR DQTLVYDTGL AFGDTFSHGE
RTLVPFLRAK GRNHIDVLVI SHEDKDHAGG AAALARAMPV HLLISDTRAA RDTLAMEHAP
CRPQAFALGN LWVEVLSPAD SPAGRVDNNA SCVVTVGDGH SRLLLPGDIE AEGETRLLGS
GEALNANVLV APHHGSLTSS TPAFVAGVAP AITLFAAGAN NRYGFPKDAV VQRYLAQGSQ
TFTAADTGQI SLYLDDEITV KTYRGSLAPF WYNRVFGVGG RPITE