Gene Sama_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2431 
Symbol 
ID4604680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2924723 
End bp2926177 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content53% 
IMG OID639781828 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_928305 
Protein GI119775565 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTA TCGTTAAGCT GTATCCCGAA ATCATGATTA AGAGCAAACC GGTAAGAATG 
CGTTTTACCA AGATGCTCGA GTCCAATATC CGCAATGTGC TGAAAAAAAT CGATGAGGAT
GCCAAGGTAC AGCGCCAATG GGACAAGATC ATGGTGAAGG TGCCCAAAGA CAAGCCTGAG
CTTACCGAGC TCTTTGCCGA GCGTCTGGCT CATATTCCAG GTATTCATCA TGTGCTGCAG
GTGGCCGAGT ACGACTTTGA AACCGTGGAT GATATCTATC AGCTCGCGCT GCCTGTGTAT
CGCGACATGC TCAAGGACAA AACCTTTTGT GTGCGGGTAA AGCGTGCCGG TCAACACGAT
TTCAACTCCA TCGAGGTTGA GCGTTATGTT GGCGGCGGTT TAAACCAGTT TACCGAAGCC
AAGGGCGTGC AGCTGAAGAA CCCGGATGTG ACCATCCAGC TCGAAATCGA TCGCGACAAG
CTCTATATGG TCAGCCAGCG CATCGAGGGT TTGGGGGGCT TCCCCATTGC CGCTCAGGAA
GACGTGCTGT CTTTGATTTC CGGTGGCTTT GACTCGGGCG TGGCCAGCTT CCAGTTTATT
AAAAAGGGTT CCCGTACCCA TTATTGTTTC TTCAATCTTG GTGGCGCCCA GCACGAAATC
GGCGTGAAAC AAGTGGCCTA CCACCTGTGG AAGACCTACG GCGAGTCGCA CAAAGTGAAG
TTTGTGTCTG TGCCCTTCGA AGAAGTGGTA ACAGAAATCC TCGAGCGTAT CGAAAACGGT
CAGATGGGTG TGGTGCTTAA GCGCATGATG ATGCGCGCCG CCACCCGTGT GGCCGAACGC
ATGGGTATTC AGGCTCTGGT CACTGGTGAG AGCCTGGGTC AGGTGTCGAG CCAGACCCTG
ACCAACCTTA ATGTCATTGA CCGCAGCACC GACCTCTTGA TCCTGCGTCC GCTGATCAGC
ATGGATAAAC CGGACATTAT CCGCGAAGCG CGCCGCATAG GTACCGAAGA TTTCGCGGCC
TCCATGCCCG AGTATTGTGG TGTGATTTCC CAGCGCCCTA CCGTGAAGGC GGTACTCTCC
AAGGTGGAAG CCGAAGAGCA GAAGTTTTCT GAAGACCTGC TCGACCGCGT GCTGGCGAAA
GCAGAAGTGA TTGATATCCG TGATATTGCT GTTGCCACCA GTGAGCGAGT GACTGAAACC
GAGACCGTCT CCAGTGCGGC CGGAAACGAA GTTATCATCG ACATTCGCGC GCCAGAAGAA
GAAGAGTCCA GACCCCTGGA CGTGGACGGC GTTGAAGTGA AGGTAATCCC CTTCTTCAAA
CTGGCGACTG CTTTTGCCGA GCTCGATAAA GACAAAACGT ATCTGCTTTA CTGCGAACGT
GGCGTTATGA GTAAGCTGCA AGCCCTGTAT CTGCAAGAGC AGGGCTACAA TAACGTGAAG
GTATATCGTC CCTGA
 
Protein sequence
MKFIVKLYPE IMIKSKPVRM RFTKMLESNI RNVLKKIDED AKVQRQWDKI MVKVPKDKPE 
LTELFAERLA HIPGIHHVLQ VAEYDFETVD DIYQLALPVY RDMLKDKTFC VRVKRAGQHD
FNSIEVERYV GGGLNQFTEA KGVQLKNPDV TIQLEIDRDK LYMVSQRIEG LGGFPIAAQE
DVLSLISGGF DSGVASFQFI KKGSRTHYCF FNLGGAQHEI GVKQVAYHLW KTYGESHKVK
FVSVPFEEVV TEILERIENG QMGVVLKRMM MRAATRVAER MGIQALVTGE SLGQVSSQTL
TNLNVIDRST DLLILRPLIS MDKPDIIREA RRIGTEDFAA SMPEYCGVIS QRPTVKAVLS
KVEAEEQKFS EDLLDRVLAK AEVIDIRDIA VATSERVTET ETVSSAAGNE VIIDIRAPEE
EESRPLDVDG VEVKVIPFFK LATAFAELDK DKTYLLYCER GVMSKLQALY LQEQGYNNVK
VYRP