Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_0207 |
Symbol | |
ID | 5110708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 240170 |
End bp | 242062 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640490369 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001174948 |
Protein GI | 146309874 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0365925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCAA AATTGACCCG CCGCGAACAG CGCGCACAAG CACAACACTT CATCGATACG CTCGAAGGCA CCGCTTTCCC GAACTCAAAA CGTATTTATA TTTCTGGTTC ACAGGCTGAT ATCCGCGTAC CCATGCGTGA GATCCAGCTC AGCCCTACGC TTCTCGGCGG CAGCAAAGAA AATCCGCAGT TTGAGGATAA CGAAGCGGTG CCGGTATATG ACACCTCCGG TCCCTATGGT GATACCGACG TTACCATCAA CGTTCAGCAA GGGCTGGCAA AACTGCGGCA GCCGTGGATT GACGCGCGTA ATGACAGCGA AGCGCTCACC GTTCGCAGCT CCGCCTACAC CAAAGAACGC CTCGCAGATG ATGGTCTTGA TGAACTGCGC TTTACCGGTC TGCTGACGCC AAAGCGGGCG AAATCAGGCA AATGCGTGAC GCAGCTGCAT TATGCGCGCC AGGGTATTGT GACCCCGGAA ATGGAATTTA TCGCCATTCG CGAAAATATG GGCCGTGAAC GGATTCACAG CGAAGTGCTT CGCCACCAGC ATCCGGGCGA AGGTTTTGGC GCGCGTCTGC CGGAGAACAT CACGCCGGAG TTTGTACGTG ATGAAGTGGC CGCTGGTCGC GCCATCATCC CCGCCAATAT TAATCATCCA GAATCGGAGC CAATGATTAT TGGCCGTAAT TTCCTGGTAA AGGTCAACGC CAATATCGGC AACTCTGCCG TTACATCATC CATCGAAGAA GAGGTCGAAA AGCTGGTGTG GTCTACACGC TGGGGGGCGG ACACGGTCAT GGACCTTTCG ACCGGGCGTT ACATCCACGA AACCCGCGAG TGGATTTTGC GAAACAGCCC CGTTCCTATC GGAACCGTGC CGATCTATCA GGCGCTGGAG AAGGTCAACG GGATCGCAGA AGATCTGACC TGGGAAGCAT TCCGCGACAC GTTACTTGAG CAGGCGGAAC AAGGCGTCGA TTACTTCACC ATTCACGCAG GCGTACTGCT GCGCTACGTG CCGATGACCG CCAAACGCCT GACCGGAATT GTCTCGCGTG GCGGCTCCAT TATGGCGAAG TGGTGCCTGT CCCATCATCA GGAAAATTTC CTCTACGAAC ACTTCCGCGA AATTTGTGAA ATCTGTGCGG CCTACGATGT GTCTCTTTCG CTGGGCGACG GGTTGCGTCC TGGCTCCATT CGCGATGCCA ACGATGAAGC GCAATTTGCC GAACTGCACA CATTGGGTGA GCTAACTAAA ATCGCGTGGG AATATGACGT GCAGGTGATG ATCGAAGGTC CCGGCCACGT CCCGATGCAG ATGATTCGCC GCAACATGAC CGAAGAGCTG GAGCACTGCC ACGAAGCGCC GTTCTACACG CTGGGACCGC TAACGACCGA TATCGCGCCG GGCTACGACC ACTTCACATC AGGGATTGGT GCCGCGATGA TCGGCTGGTT TGGCTGCGCG ATGCTCTGTT ACGTCACGCC AAAAGAACAC CTGGGCTTAC CCAACAAAGA AGATGTAAAA CAGGGATTAA TTACCTATAA AATTGCCGCT CACGCCGCCG ACCTGGCGAA AGGCCATCCG GGCGCGCAAA TCCGCGATAA CGCCATGTCT AAAGCGCGTT TCGAATTTCG CTGGGAAGAT CAGTTTAACC TAGCGCTCGA CCCCTTCACC GCCCGTGCGT ATCACGACGA AACCCTGCCG CAAGAATCTG GCAAAGTGGC GCACTTCTGC TCGATGTGCG GGCCAAAATT CTGCTCGATG AAAATCAGCC AGGAAGTGCG CGATTACGCC GCGAAACAGG CTATCGAAGT GGGTATGGCC GATATGTCAC AAAACTTCCG CGCGAAAGGT GGCGAAATCT ACCTTAAAAA GGAGAAGGCA TAA
|
Protein sequence | MSAKLTRREQ RAQAQHFIDT LEGTAFPNSK RIYISGSQAD IRVPMREIQL SPTLLGGSKE NPQFEDNEAV PVYDTSGPYG DTDVTINVQQ GLAKLRQPWI DARNDSEALT VRSSAYTKER LADDGLDELR FTGLLTPKRA KSGKCVTQLH YARQGIVTPE MEFIAIRENM GRERIHSEVL RHQHPGEGFG ARLPENITPE FVRDEVAAGR AIIPANINHP ESEPMIIGRN FLVKVNANIG NSAVTSSIEE EVEKLVWSTR WGADTVMDLS TGRYIHETRE WILRNSPVPI GTVPIYQALE KVNGIAEDLT WEAFRDTLLE QAEQGVDYFT IHAGVLLRYV PMTAKRLTGI VSRGGSIMAK WCLSHHQENF LYEHFREICE ICAAYDVSLS LGDGLRPGSI RDANDEAQFA ELHTLGELTK IAWEYDVQVM IEGPGHVPMQ MIRRNMTEEL EHCHEAPFYT LGPLTTDIAP GYDHFTSGIG AAMIGWFGCA MLCYVTPKEH LGLPNKEDVK QGLITYKIAA HAADLAKGHP GAQIRDNAMS KARFEFRWED QFNLALDPFT ARAYHDETLP QESGKVAHFC SMCGPKFCSM KISQEVRDYA AKQAIEVGMA DMSQNFRAKG GEIYLKKEKA
|
| |