Gene Spro_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1081 
Symbol 
ID5606831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1189053 
End bp1190501 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content55% 
IMG OID640936600 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001477313 
Protein GI157369324 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00012217 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACCATCA AGAGCCAATC TGTGCGCTTG 
CGCTTTATCA AGATCCTCTC GACCAGTATT CGCAACGTCC TGAAGCAGTA TGATGAAACA
CTGGCGGTTG TCCGTCACTG GGATCATATC GAAGTTCGCG CCAAAGATGA AAACCAGCGG
CCGGTGATTG CTGACGCGCT GACGCGCATT CCCGGTATCC ACCACATTCT GGAAGTGGAA
GACCGCGACT ATACCGATAT CCACCATATT TTCGAGCAGA CGCTGGAAGC CTACCGCGAG
CAACTGGAAG GCAAGACCTT CTGCGTGCGC GTTAAGCGCC GCGGCAAGCA GGACTTCAAT
TCGCAGGACG TGGAGCGTTA CGTCGGTGGC GGTCTGAACC AGCATATTGA AAGCGCGCGC
GTTAAGCTGT CGCATCCGCA GGTGACGGTT AACCTGGAAA TCGAAAACGA CAAGCTGATG
CTGGTGAAAG CCCGCCGTGA AGGCATCGGC GGCTACCCGG TAGGCACCCA GGAAGACGTG
CTGTCGCTGA TTTCCGGTGG TTTCGACTCG GGTGTTTCCA GTTATATGCT GATGCGCCGC
GGTTGCCGTG TGCATTATTG CTTCTTCAAT CTGGGTGGCG CGGCGCATGA AATTGGCGTG
CGCCAGGTAG CTCACTATCT GTGGAACCGC TTTGCCAGTT CGCACAAGGT GCGCTTTATC
GCCATCGATT TTGAACCTGT GGTTGGCGAG ATCCTGGAAA AGGTCGAAGA CGGCCAGATG
GGCGTGGTGC TCAAGCGCAT GATGGTGCGT GCCGCTTCGC AGATTGCCGA ACGTTATGGC
GTGCAGGCAT TGGTGACCGG TGAAGCGCTG GGGCAGGTAT CCAGCCAGAC GCTGACTAAC
CTGCGCCTGA TCGACAACGC GTCCGATACG CTGATCCTGC GTCCGCTGAT CTCCCACGAC
AAAGAGCACA TCATCAAAGT GGCGCGTGAG ATCGGCACCG AGGACTTCGC CAAGACCATG
CCGGAGTACT GCGGTGTGAT CTCGAAAAGC CCGACGGTGA AGGCGATTAA AGCCAAGATC
GAAGAAGAAG AAGGGCACTT TGATTTCAGC ATTCTTGATC GCGTGGTTAG CGAAGCCAAG
AACGTGGATA TTCGCACCAT CGCCGAGCAG ACTCAGGAGC AGGTTACCGA AGTCGAAACC
GTGGCGGAGT TCGATGCTGA CCAGGTGATT CTGGATATTC GTTCTAACGA CGAGCAGGAA
GATAAACCGC TGAAGCTGGA TCAGGTTGAG GTGAAACCGC TGCCGTTCTA CAAGCTCAGC
ACCCAGTTTG GCGATTTGGA TCAGAGCAAA ACTTACCTGC TGTACTGTGA GCGCGGCGTG
ATGAGCCGCC TGCAGGCGCT GTACCTGCTG GAGCAGGGGT TCACCAACGT GAAGGTTTAC
CGCCCATAA
 
Protein sequence
MKFIIKLFPE ITIKSQSVRL RFIKILSTSI RNVLKQYDET LAVVRHWDHI EVRAKDENQR 
PVIADALTRI PGIHHILEVE DRDYTDIHHI FEQTLEAYRE QLEGKTFCVR VKRRGKQDFN
SQDVERYVGG GLNQHIESAR VKLSHPQVTV NLEIENDKLM LVKARREGIG GYPVGTQEDV
LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV RQVAHYLWNR FASSHKVRFI
AIDFEPVVGE ILEKVEDGQM GVVLKRMMVR AASQIAERYG VQALVTGEAL GQVSSQTLTN
LRLIDNASDT LILRPLISHD KEHIIKVARE IGTEDFAKTM PEYCGVISKS PTVKAIKAKI
EEEEGHFDFS ILDRVVSEAK NVDIRTIAEQ TQEQVTEVET VAEFDADQVI LDIRSNDEQE
DKPLKLDQVE VKPLPFYKLS TQFGDLDQSK TYLLYCERGV MSRLQALYLL EQGFTNVKVY
RP