Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3005 |
Symbol | thiC-2 |
ID | 2688215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 3297117 |
End bp | 3298424 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637127698 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | NP_954047 |
Protein GI | 39998096 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.933742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAACCC AGATCGAACA GGCCCGCGAG GGCATCATTA CCCCTCAGAT GGCGGCCGTG GCCGCGGAGG AGCACGTCTC CCCCGAGTAT GTCTGCCGGA TGGTGGCCGA GGGGAAGGTC GTCATCCCCT GGAACCACGT GCGCGCGCCA AAGGCAGTCG GCATCGGCAA GGGGCTGCGG ACCAAGGTGA ATGCCTCCAT CGGCACCTCA TCGGACATCG TTGACTACGA GGCCGAGGTG CGCAAGGCCC GGGCCGCCCA GGAGTCAGGC GCCGACACCC TCATGGAGCT GTCCGTGGGC GGCGACCTGG ACCGGGTCCG GCGCGAGGTC ATCGCGGCCG TGGACCTGCC GGTGGGAAAC GTTCCGCTCT ACCAGGCCTT CTGCGAGGCG GCGAGGAAAT ACGGCGACCC CAACCGGCTC GACCCTGAGA TGCTCTTTGA CCTGATTGAG CGCCAGTGCG CCGACGGCAT GGCCTTCATG GCGGTCCACT GCGGCATCAA TCTCTACACC ATCGAACGGC TCCGTCGCCA GGGGTACCGC TACGGCGGCC TCGTCTCAAA GGGAGGGGTG AGCATGGTGG GCTGGATGAT GGCCAACGGC CGCGAGAATC CCCTCTACGA ACAGTTCGAC CGGGTGGTCG GTATCCTGAA GAAATACGAC ACGGTCCTTT CCCTGGGCAA CGGCCTGCGG GCCGGCGCCA TCCACGATTC ATCGGACCGG GCCCAGATCC AGGAGCTGCT GATCAACTGC GAACTGGCGG AGATGGGGCG CGAGATGGGC TGCCAGATGC TGGTGGAGGG CCCCGGTCAC GTCCCCCTGG ACGAGGTGGA GGGGAACATC CAGCTCCAGA AGCGGATGAG CGGCGGCGCA CCCTACTATA TGCTCGGGCC CATCTCCACC GACGTGGCCC CCGGCTTCGA CCACATCACC GCCGCCATTG GCGCGGCCCA GTCGAGCCGC TTCGGCGCCG ACCTGATCTG CTACATCACC CCGGCCGAGC ACCTGGCCCT CCCCAACGAA GAGGACGTCC GCCAGGGGGT AAAGGCGGCC CGGGTCGCCG CCTACATCGG CGACATGAAC AAGTACCCGG AGAAGGGGCG CGAGCGGGAC CGGGAGATGA GCAAGGCCCG TCGCGACCTG GACTGGCAGA GGCAGTTCGA GCTGGCCCTC TATCCAGAGG ACGCCCGGGC GATCAGGGCC AGCCGTACTC CCGAGGATGA GGCCACCTGC ACCATGTGCG GCGACTTCTG CGCCTCCCGG GGGGCCGGCA GGCTGTTTGC CGGCGATCTC AGGGGGGATA AGGTGTAG
|
Protein sequence | MKTQIEQARE GIITPQMAAV AAEEHVSPEY VCRMVAEGKV VIPWNHVRAP KAVGIGKGLR TKVNASIGTS SDIVDYEAEV RKARAAQESG ADTLMELSVG GDLDRVRREV IAAVDLPVGN VPLYQAFCEA ARKYGDPNRL DPEMLFDLIE RQCADGMAFM AVHCGINLYT IERLRRQGYR YGGLVSKGGV SMVGWMMANG RENPLYEQFD RVVGILKKYD TVLSLGNGLR AGAIHDSSDR AQIQELLINC ELAEMGREMG CQMLVEGPGH VPLDEVEGNI QLQKRMSGGA PYYMLGPIST DVAPGFDHIT AAIGAAQSSR FGADLICYIT PAEHLALPNE EDVRQGVKAA RVAAYIGDMN KYPEKGRERD REMSKARRDL DWQRQFELAL YPEDARAIRA SRTPEDEATC TMCGDFCASR GAGRLFAGDL RGDKV
|
| |