Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3893 |
Symbol | |
ID | 8139267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4479605 |
End bp | 4480912 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644871510 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003023668 |
Protein GI | 253702479 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 0.377066 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACAC AGATAGAATC GGCGCGCGCC GGTGTTACCA CCCCGCAAAT GACGCAGGTA GCTTTCGACG AAGATCTGAC GCCGGAGTAC GTCCGGGAGA AGGTGGCGCA GGGGCAGATA GTGATCCCCT GGAACCATGT CAGGAAACCG AAGGTGGCCG GGATCGGCGC GGGGCTCAGG ACCAAGGTGA ACGCCTCCAT CGGGACCTCC TCCGACATCG TCGACTACGA GGCCGAGGTG AAGAAGGGGC TCGCGGCGCA GGAGTCCGGG GCAGACACCC TTATGGAACT ATCCGTAGGA GGGGACCTGG ACCGGGTGCG GCGCGAGGTG ATCGCGGCGG TGGATCTTCC CGTCGGCAAC GTCCCCCTCT ACCAGGCCTT TTGCGAGGCA GCCAGGAAGT ACGGCGACCC CAATAAACTC GACGACGAGA TGCTCTTCGA CATCATCGAA AGGCAGTGCG CCGACGGCAT GGCCTTCATG GCGGTTCACT GCGGCATCAA CCTCTACACC CTGGAGCGCC TGAGGAAGCA GGGTTACCGC TACGGCGGCC TGGTCTCCAA GGGTGGGGTG AGCATGGCGG CCTGGATGAT CGCCAACAAC CGCGAGAACC CGCTCTACGA GAAGTTCGAC CGGGTGGTGG AGATCCTCAA GCGCTACGAC ACCGTGCTGT CGCTGGGCAA CGGCTTGAGG GCCGGCGCCA TCCACGACTC CAGCGACCGG GCGCAGATCC AGGAGCTTTT GATCAACTGC GAGCTGGCGG AACTCGGGCG CGACATGGGG TGCCAGATGC TGGTGGAGGG GCCGGGACAC GTGCCTTTGG ACGAGATCGA GGGAAACATC AAGCTGCAGA AGCGGATGAG CGGCGGTGCC CCCTACTACA TGCTCGGCCC CATCGCCACC GACGTCGCCC CGGGCTTCGA CCATATCACC GCGGCCATAG GCGCGGCCCA GTCTTCGCGC TACGGCGCCG ACCTCATCTG CTACATCACC CCCGCCGAGC ACCTGGCGCT TCCCAACGAG GAGGACGTCC GCATGGGGGT GAAGGCCGCG AAGATCGCCG CCTACATAGG CGACATGAAC AAGTACCCCG AGAAGGGTAG GGAGCGCGAC AAGGAGATGA GCAAGGCCAG AAGGGACCTC GACTGGCAGC GCCAGTTCGA GCTCGCCCTG TTCCCCGAGG ACGCCGCCGC CATCAGGAAG AGCCGCGTAC CCGAGGACGA GCAGACCTGC ACCATGTGCG GCGACTTCTG CGCCTCCCGC GGCGCCGGCA AGATCTTCGC CGGCGACCTG AGGGGTGACA AGATCTAA
|
Protein sequence | MMTQIESARA GVTTPQMTQV AFDEDLTPEY VREKVAQGQI VIPWNHVRKP KVAGIGAGLR TKVNASIGTS SDIVDYEAEV KKGLAAQESG ADTLMELSVG GDLDRVRREV IAAVDLPVGN VPLYQAFCEA ARKYGDPNKL DDEMLFDIIE RQCADGMAFM AVHCGINLYT LERLRKQGYR YGGLVSKGGV SMAAWMIANN RENPLYEKFD RVVEILKRYD TVLSLGNGLR AGAIHDSSDR AQIQELLINC ELAELGRDMG CQMLVEGPGH VPLDEIEGNI KLQKRMSGGA PYYMLGPIAT DVAPGFDHIT AAIGAAQSSR YGADLICYIT PAEHLALPNE EDVRMGVKAA KIAAYIGDMN KYPEKGRERD KEMSKARRDL DWQRQFELAL FPEDAAAIRK SRVPEDEQTC TMCGDFCASR GAGKIFAGDL RGDKI
|
| |