Gene Ent638_0207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_0207 
Symbol 
ID5110708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp240170 
End bp242062 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content55% 
IMG OID640490369 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001174948 
Protein GI146309874 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0365925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCAA AATTGACCCG CCGCGAACAG CGCGCACAAG CACAACACTT CATCGATACG 
CTCGAAGGCA CCGCTTTCCC GAACTCAAAA CGTATTTATA TTTCTGGTTC ACAGGCTGAT
ATCCGCGTAC CCATGCGTGA GATCCAGCTC AGCCCTACGC TTCTCGGCGG CAGCAAAGAA
AATCCGCAGT TTGAGGATAA CGAAGCGGTG CCGGTATATG ACACCTCCGG TCCCTATGGT
GATACCGACG TTACCATCAA CGTTCAGCAA GGGCTGGCAA AACTGCGGCA GCCGTGGATT
GACGCGCGTA ATGACAGCGA AGCGCTCACC GTTCGCAGCT CCGCCTACAC CAAAGAACGC
CTCGCAGATG ATGGTCTTGA TGAACTGCGC TTTACCGGTC TGCTGACGCC AAAGCGGGCG
AAATCAGGCA AATGCGTGAC GCAGCTGCAT TATGCGCGCC AGGGTATTGT GACCCCGGAA
ATGGAATTTA TCGCCATTCG CGAAAATATG GGCCGTGAAC GGATTCACAG CGAAGTGCTT
CGCCACCAGC ATCCGGGCGA AGGTTTTGGC GCGCGTCTGC CGGAGAACAT CACGCCGGAG
TTTGTACGTG ATGAAGTGGC CGCTGGTCGC GCCATCATCC CCGCCAATAT TAATCATCCA
GAATCGGAGC CAATGATTAT TGGCCGTAAT TTCCTGGTAA AGGTCAACGC CAATATCGGC
AACTCTGCCG TTACATCATC CATCGAAGAA GAGGTCGAAA AGCTGGTGTG GTCTACACGC
TGGGGGGCGG ACACGGTCAT GGACCTTTCG ACCGGGCGTT ACATCCACGA AACCCGCGAG
TGGATTTTGC GAAACAGCCC CGTTCCTATC GGAACCGTGC CGATCTATCA GGCGCTGGAG
AAGGTCAACG GGATCGCAGA AGATCTGACC TGGGAAGCAT TCCGCGACAC GTTACTTGAG
CAGGCGGAAC AAGGCGTCGA TTACTTCACC ATTCACGCAG GCGTACTGCT GCGCTACGTG
CCGATGACCG CCAAACGCCT GACCGGAATT GTCTCGCGTG GCGGCTCCAT TATGGCGAAG
TGGTGCCTGT CCCATCATCA GGAAAATTTC CTCTACGAAC ACTTCCGCGA AATTTGTGAA
ATCTGTGCGG CCTACGATGT GTCTCTTTCG CTGGGCGACG GGTTGCGTCC TGGCTCCATT
CGCGATGCCA ACGATGAAGC GCAATTTGCC GAACTGCACA CATTGGGTGA GCTAACTAAA
ATCGCGTGGG AATATGACGT GCAGGTGATG ATCGAAGGTC CCGGCCACGT CCCGATGCAG
ATGATTCGCC GCAACATGAC CGAAGAGCTG GAGCACTGCC ACGAAGCGCC GTTCTACACG
CTGGGACCGC TAACGACCGA TATCGCGCCG GGCTACGACC ACTTCACATC AGGGATTGGT
GCCGCGATGA TCGGCTGGTT TGGCTGCGCG ATGCTCTGTT ACGTCACGCC AAAAGAACAC
CTGGGCTTAC CCAACAAAGA AGATGTAAAA CAGGGATTAA TTACCTATAA AATTGCCGCT
CACGCCGCCG ACCTGGCGAA AGGCCATCCG GGCGCGCAAA TCCGCGATAA CGCCATGTCT
AAAGCGCGTT TCGAATTTCG CTGGGAAGAT CAGTTTAACC TAGCGCTCGA CCCCTTCACC
GCCCGTGCGT ATCACGACGA AACCCTGCCG CAAGAATCTG GCAAAGTGGC GCACTTCTGC
TCGATGTGCG GGCCAAAATT CTGCTCGATG AAAATCAGCC AGGAAGTGCG CGATTACGCC
GCGAAACAGG CTATCGAAGT GGGTATGGCC GATATGTCAC AAAACTTCCG CGCGAAAGGT
GGCGAAATCT ACCTTAAAAA GGAGAAGGCA TAA
 
Protein sequence
MSAKLTRREQ RAQAQHFIDT LEGTAFPNSK RIYISGSQAD IRVPMREIQL SPTLLGGSKE 
NPQFEDNEAV PVYDTSGPYG DTDVTINVQQ GLAKLRQPWI DARNDSEALT VRSSAYTKER
LADDGLDELR FTGLLTPKRA KSGKCVTQLH YARQGIVTPE MEFIAIRENM GRERIHSEVL
RHQHPGEGFG ARLPENITPE FVRDEVAAGR AIIPANINHP ESEPMIIGRN FLVKVNANIG
NSAVTSSIEE EVEKLVWSTR WGADTVMDLS TGRYIHETRE WILRNSPVPI GTVPIYQALE
KVNGIAEDLT WEAFRDTLLE QAEQGVDYFT IHAGVLLRYV PMTAKRLTGI VSRGGSIMAK
WCLSHHQENF LYEHFREICE ICAAYDVSLS LGDGLRPGSI RDANDEAQFA ELHTLGELTK
IAWEYDVQVM IEGPGHVPMQ MIRRNMTEEL EHCHEAPFYT LGPLTTDIAP GYDHFTSGIG
AAMIGWFGCA MLCYVTPKEH LGLPNKEDVK QGLITYKIAA HAADLAKGHP GAQIRDNAMS
KARFEFRWED QFNLALDPFT ARAYHDETLP QESGKVAHFC SMCGPKFCSM KISQEVRDYA
AKQAIEVGMA DMSQNFRAKG GEIYLKKEKA