Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3852 |
Symbol | |
ID | 5167007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 4501462 |
End bp | 4502769 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640551334 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001232575 |
Protein GI | 148265869 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAGA CTCAACTTGA CTACGCCCGC CAGGGCACGA TCACCAAAGA AATGAAGGAA GCAGCCCTCG CCGAAGGGGT AAGCCCGGAG TTCATCCGCG ACGGACTGGT TGCCGGCAAC ATCATCATCT GCCATAACAT CAAGCACGCA GGCGGTCGAC CGCTGGCGGT AGGCCGCGGA CTGCGCACCA AGGTCAACGC CAACATCGGC ACCTCGGCCG ACGACCTGGA CATAGCCAAG GAGCTGGAAA AGGCCCGCGT AGCGGTAAAA CACGGCGCAG ACGCCATCAT GGACCTCTCC ACCGGCGGAC CGGTTGATGA GATCCGCCGC GCCATCATTG CCGAAACCAG TGCCTGCATC GGCAGCGTAC CCCTCTATCA GGCGGCCCTC GATGCGGTAC GGACAAAGAA GAAGGCGATC GTCGACATGA CCGTGGACGA CATTTTCGCC GGGATAATCA AGCATGCCGA AGACGGAGTG GATTTCATCA CCGTCCACTG CGGCGTGACC TGCGCAACGG TGGAGCGGAT GAAAAACGAG GGTCGGATCA TGGACGTGGT CTCCCGCGGC GGGGCGTTCA CCATCGAGTG GATGGCCCAC AACAACAAGG AAAACCCGCT CTTCGAGCAC TTCGACCGGC TCCTGGAAAT CACCAAAGAG TATGACATGA CCCTCTCCCT GGGTGACGGC TTCCGCCCCG GCTGCCTCGC CGACGCCACC GACCGGGCGC AGATCCACGA ACTGATCCTT CTGGGCGAGC TGACCCAGCG CGCCCAGGCA TTCGGCGTCC AGGTCATGAT TGAAGGTCCG GGGCACATGC CGCTCAACCA GATCGAGGCC AACATCCTCC TGCAGAAGAG GCTCTGTCAC GGCGCCCCAT TCTATGTGCT CGGCCCGCTG GTCACCGACA TCGCCCCGGG CTACGACCAT ATCACCTGCG CCATCGGCGG CACCATCGCC GCCGCCGCCG GGGCCGACTT CCTCTGCTAT GTCACCCCCA GCGAACACCT GCGCCTCCCG ACCGTGGACG ACGTGAGAGA AGGGGTCATC GCCTCCCGCA TCGCCGCCCA CGCTGCCGAC ATCGTCAAGG GGGTGAAGGG GGCGATGGAC AAGGACATCC AGATGGCCAA GTGCCGGAAA AAGCTCGACT GGGAAGGGCA GTTCGCCCTG GCCCTCGACC CGGAAAAGGC CCGGCGGCTG CGCGCCGAAT CAGGGGTTGC CGACCACGGC GCCTGCACCA TGTGCGGCGA GTTCTGCGCC TACAAGGTGA TGGACGACGC CATGGAAAAG CAGGCGGTCG AATCGTAA
|
Protein sequence | MTKTQLDYAR QGTITKEMKE AALAEGVSPE FIRDGLVAGN IIICHNIKHA GGRPLAVGRG LRTKVNANIG TSADDLDIAK ELEKARVAVK HGADAIMDLS TGGPVDEIRR AIIAETSACI GSVPLYQAAL DAVRTKKKAI VDMTVDDIFA GIIKHAEDGV DFITVHCGVT CATVERMKNE GRIMDVVSRG GAFTIEWMAH NNKENPLFEH FDRLLEITKE YDMTLSLGDG FRPGCLADAT DRAQIHELIL LGELTQRAQA FGVQVMIEGP GHMPLNQIEA NILLQKRLCH GAPFYVLGPL VTDIAPGYDH ITCAIGGTIA AAAGADFLCY VTPSEHLRLP TVDDVREGVI ASRIAAHAAD IVKGVKGAMD KDIQMAKCRK KLDWEGQFAL ALDPEKARRL RAESGVADHG ACTMCGEFCA YKVMDDAMEK QAVES
|
| |