Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0604 |
Symbol | thiC-1 |
ID | 2687308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 637928 |
End bp | 639238 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637125271 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | NP_951662 |
Protein GI | 39995711 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATGA CGCAACTGGA ATACGCCCGC AACGGAATCA TCACCGACAA GATGAAAGAG GCCGCCCTGG CCGAAGGGGT ATCGCCCGAA TTCATCAAGG CGGGCATTGC CGACGGGACC ATCATCATCT GCCACAACAA CAAGCATCAC AACGGCCGCC CCCTGGCCGT GGGCACAGGG CTACGGACCA AGGTCAACGC AAACATCGGC ACCTCCGCCG ACGACACGGA TATCACCAAG GAGCTTGAGA AGGCCCGGGT GGCGGTTCGC CACGGTGCCG ACGCCATCAT GGATCTCTCC ACCGGCGGAC CGGTGGACGA AATCCGCCGC GCGATCATCG CCGAGACGAA TGCCTGCATC GGCAGCGTCC CCCTCTACCA GGCGGCCCTT GATGCGGTCA GGACCAAGAA AAAGGCCATC GTCGACATGA CCGTGGACGA TATCTTCGAG GGGATCATCA AGCATGCCGA GGACGGAGTC GACTTCATAA CCGTCCACTG CGGCGTGACC CGCGCCACGG TCGAGCGCAT GAAGAACGAG GGGCGCATCA TGGACGTGGT TTCCCGCGGC GGCGCCTTCA CCGTGGAGTG GATGACCTAC AACAACGCCG AAAACCCTCT CTTCGAGCAC TTTGACCGGC TGCTCGACAT CGTCAAGGCA TATGACATGA CCCTGTCGCT GGGGGACGGC TTCCGCCCCG GCTGCCTCGC GGATGCCACT GACCGGGCCC AGATCCACGA GCTGATCCTT CTGGGCGAGC TGACCCAGCG GGCTCAGGAC GTCGGGGTTC AGGTCATGAT CGAAGGCCCC GGGCACGTGC CGCTCAACCA GATCGAAGCG AACATTCTCC TCCAGAAGCG ACTCTGCCAC GGCGCTCCCT TTTATGTCCT CGGCCCGCTG GTAACCGATA TCGCTCCCGG CTACGACCAC ATCACCTGCG CGATCGGCGG CGCCATCGCC GCGGCGGCCG GGGCCGACTT CCTCTGCTAC GTGACACCGA GCGAGCACCT GCGACTGCCG AGCGTGGAAG ACGTCCGCGA AGGGGTCATC GCCTCCCGCA TCGCCGCCCA TGCCGCCGAT ATAGCCAAAG GGGTAAAGGG GGCCATGGCT AAGGACATCG CCATGGCTAA ATGCCGTAAG AAGCTGGACT GGGAAGGTCA ATTCGAACAG GCCCTCGACC CGGAAAAAGC CCGGCGCATG CGCGACGAAT CGGGCGTGGC CGAGCACGGC GCCTGCACCA TGTGTGGCGA GTTCTGTGCT TACAAGGTGA TGGACGACGC CATGGAAAAG CAGCGGACGG CAACCCTCTA G
|
Protein sequence | MAMTQLEYAR NGIITDKMKE AALAEGVSPE FIKAGIADGT IIICHNNKHH NGRPLAVGTG LRTKVNANIG TSADDTDITK ELEKARVAVR HGADAIMDLS TGGPVDEIRR AIIAETNACI GSVPLYQAAL DAVRTKKKAI VDMTVDDIFE GIIKHAEDGV DFITVHCGVT RATVERMKNE GRIMDVVSRG GAFTVEWMTY NNAENPLFEH FDRLLDIVKA YDMTLSLGDG FRPGCLADAT DRAQIHELIL LGELTQRAQD VGVQVMIEGP GHVPLNQIEA NILLQKRLCH GAPFYVLGPL VTDIAPGYDH ITCAIGGAIA AAAGADFLCY VTPSEHLRLP SVEDVREGVI ASRIAAHAAD IAKGVKGAMA KDIAMAKCRK KLDWEGQFEQ ALDPEKARRM RDESGVAEHG ACTMCGEFCA YKVMDDAMEK QRTATL
|
| |