Gene Sama_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1899 
Symbol 
ID4604149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2316415 
End bp2318484 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content55% 
IMG OID639781276 
Producttransglutaminase family protein 
Protein accessionYP_927774 
Protein GI119775034 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.642436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGAC CTTCCACCAA ACACCACGCA GAAATTATCG GCCGCAGTAC ACTCTTTTGG 
TTGCTGCTGA CTCACTTTGC GCTCATAGCT CCCCTCGCCG AAAAATCGAC CCCATGGACC
CTCGCCATCA GCGCCATCTG CCTGGTTTGG CGGGTTGGGA TTTTCTATGG CAGGGTCGCG
CGGCCACCGC GGCTCTTGGT AACAGGGCTC GGCATTGCCT CTGCTATCAC ACTGGCCTTG
GTTGGCAAAC AGATAGGTCT TCTCAACGCC TTGATGAACC TGCTAATCCT CGGCTACTCA
TTGAAAACCA TTGAAATGTT GGGTAAGCGA GATGTGCGCA CTGTCATTCT GGTGGGGTAT
TTCCTGATTG CCATCAACCT CATTGATAAT CAGGGAATAG GTGCCATGGC ACTGGCCTTG
ACGCTGTTCT GGCTTAATAC CCAATCGCTG CTGTCTCTTT ATCGCGATCC TGGGAGCAAA
CGCGATGCCC TGGCCGTAAA ACTGGTATTG CAAAGCCTGC CATTGGCCAT ACTGCTATTT
TTAGTGTTGC CAAGACTGCC ACCACTTTGG ATGGTACCCA GCCTCAAGAG CAGTATCACG
GGCCTTGGCA GCGAAGTCGG ATTTGGTGAT ATCAGTAAAC TGACCCAATC CGATGCGTTG
GCATTCAGAG CCAGATTCGA TGGCCAGGTG CCGGCCAATC CGGACCTCTA TTGGCGCGCG
CTGGTGCTGG AAGACTACGA TGGTGCCCAT TGGCGACAAC ACATTGGGAT AAAACGGCTC
GAACGGGAGG CCTTTCTACT GGGCAGCGGC CGCGCAGCGC CCGCAGATGG ACCGCAGCAG
ACATCAAACC GTCCCCGAAA AGACCGGTTG AATTACGAGG TCATTGCCGA GCCAAGCGGT
CAACGATGGC TGTTCGGTTT GGATGTGGCA CATTCCGATA CCCAGGGTGT GGTTAACCTG
CCTGATTATC GCCTGTTTGC ATTAAGACCG CTGGATGGAC GTTTTCAATA TCGCGCCAGC
GCCTTTGATG CCCGTATGGA CTCAAGGCTC CCAGACAGTG TGAGGCGCCT TAATCTTTCG
CTTCCCGATG GGATTAATCC CCGCACCCGT GAGCTGGTCA GTCAGCTTAA GGCCCAAAGC
GATTCACCAC AGGACTTTGT CCAAAGGCTG ATGACAAGGT TTCGTACAGA AGCCTATTTC
TACACCCTGT CACCGCCGCC GGTGGGCACC GCCCAGATAG ATGATTTCCT GTTTGATAAT
AAAAAAGGCT TTTGTGTGCA TTACGCCACC GCCTTGACGT TTATGGCCAG AGCCGCAGGT
ATTCCGGCAC GCCTTGTCTC GGGTTATCAG GGTGGCGAGC TGAACCCTGG GGCCGGTTAT
GTCAGCGTCT ATCAGTATAT GGCGCACGCC TGGAGTGAAG TTTGGTTTGA AGGCCGGGGT
TGGGTTCGGG TAGACCCCAC CGCCATGATA GCCCCTTCCC GTATCCTCGA TGGCTTTGAT
GCCACTTTCA ACCCGGATGA GAGTTACCTG GCAGAAAACC CTTTCAGTGG GCATAGAGTG
AAGGAGATCC CCTGGCTGAA TAATCTCAGG CTGAAGCTCG CCAGTATTGA TTATTACTGG
AGTGTATGGG TACTGGGTTT TGACAATGAA CGCAGGGAAG GTTTGCTTAA AGGTCTCCTT
GGCGGCCTTA ACCCCACCCG CCTAGTCCTG TTTGTATTGG GGATATGTGG CGCCATTTTG
CTGTTTGTGG CCTGGCAGGC GGGCATTTTG CGATTACCCG GCAGGCAACC CCCCTTGCTT
ATGGCGTTCG CACGGATTGA AAAGTCCCTG ACCTCTATCG GGCTGCCACG GAGAATTGCC
GAAGGCCCCG CGGATTATGC GGCCCGAGTG GGCGCAGCGA TGCCGCCACT GGCGATAGAC
CTTAGCCGCT GGGCCCTGCA GTTCAGCCAC CTGAGATATA GCAGTGACAA ACCAAGCCCG
AGGGCACTGC AACGCTTTGT CAAAGATAGC CGTGTCCTTG CCAGAGGCAT CCGTAAACAA
AGCAAAATTT CAACGACCAC CCCTGACTGA
 
Protein sequence
MMGPSTKHHA EIIGRSTLFW LLLTHFALIA PLAEKSTPWT LAISAICLVW RVGIFYGRVA 
RPPRLLVTGL GIASAITLAL VGKQIGLLNA LMNLLILGYS LKTIEMLGKR DVRTVILVGY
FLIAINLIDN QGIGAMALAL TLFWLNTQSL LSLYRDPGSK RDALAVKLVL QSLPLAILLF
LVLPRLPPLW MVPSLKSSIT GLGSEVGFGD ISKLTQSDAL AFRARFDGQV PANPDLYWRA
LVLEDYDGAH WRQHIGIKRL EREAFLLGSG RAAPADGPQQ TSNRPRKDRL NYEVIAEPSG
QRWLFGLDVA HSDTQGVVNL PDYRLFALRP LDGRFQYRAS AFDARMDSRL PDSVRRLNLS
LPDGINPRTR ELVSQLKAQS DSPQDFVQRL MTRFRTEAYF YTLSPPPVGT AQIDDFLFDN
KKGFCVHYAT ALTFMARAAG IPARLVSGYQ GGELNPGAGY VSVYQYMAHA WSEVWFEGRG
WVRVDPTAMI APSRILDGFD ATFNPDESYL AENPFSGHRV KEIPWLNNLR LKLASIDYYW
SVWVLGFDNE RREGLLKGLL GGLNPTRLVL FVLGICGAIL LFVAWQAGIL RLPGRQPPLL
MAFARIEKSL TSIGLPRRIA EGPADYAARV GAAMPPLAID LSRWALQFSH LRYSSDKPSP
RALQRFVKDS RVLARGIRKQ SKISTTTPD