Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0441 |
Symbol | |
ID | 2686372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 472705 |
End bp | 473790 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637125107 |
Product | radical SAM domain-containing protein |
Protein accession | NP_951500 |
Protein GI | 39995549 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATTT CATTGGCGAC AATTAGCGGC CGGGTCCATG CCGGCGAACG GATCAGCGAC GAGGAGGCCC TGTTCCTCTT CGAGAGCCGC GACCCCCTGG CCGTGGGAGA ACTGGCCGCA GCCGTCAACC GCCGCCGCAA CGGCGACCGG GTCTTCTTCA ACGTGAACCG GCACATCAAC CACACGAACA TCTGCGTCAA CCGCTGCTCC TTCTGTGCCT TCTACCGCGC CGCCGACGAG CCGGGGGCAT ACCTCTACGA CCTGGAGGAG ATCCGCAACC GCGCGGCCGA GGCCCACGCC CAGGGTGCCA CCGAGATTCA CATCGTGGGC GGCCTCCACC CCGATCTTCC CTTCGACTTC TATCTCGCCA TGCTCCGGAC CGTGAAGGAG GTCTCCCCGG ACCTCCACGT GAAGGCCTTT ACCGCGGTTG AGATCGAGTA CCTGTCGCGG CTCGCCGGCC TTTCGACGGC CGAAACCCTG ACAGTGCTGA AGGAGGCGGG ACTCGGCTCG CTCCCCGGCG GCGGGGCGGA AATCTTTGCT CCGGCCGTGC GCAACCGGCT CTGTCCCGAG AAGATCTCCG GCGACAAGTG GCTGGCCATC ATGGAGGAGG TCCACCGGGC CGGGCTCAAA TCCAATGCCA CCATGCTCTA CGGCCACATC GAGAGCTACG CGGACCGCGT GGACCACATG CGCCGCCTGC GTGAGCTTCA GGACCGCACC GGCGGCTTCC AGGTCTTCAT CCCCCTGGCC TTCCAGAAGG ATAACAACCC CCTGGGGCAC CTGAAACGCC CCGGGCCCGG TGGGGTCGAC GCCCTGCTCA CCCTGGCCGT GGCCCGCATC TACCTGGACA ATTTCGCCAA TATCAAAGCC TACTGGGTCA TGCTCGGGGT AAAGATCGCC CAGACTTCCC TGGCCTTCGG GGTAAACGAT CTGGACGGCA CGGTGGTGGA GGAGAAGATC GGCCATGATG CCGGCGCCGC TTCCCCCCAG ACCATGGGGC GCGACGAAAT CGTCTCCCTG ATCCGCACGG CCGGCCGGGT GCCGGTAGAG CGGGATACGC TGTACAACGA ACTGCGGGTG TACTGA
|
Protein sequence | MSISLATISG RVHAGERISD EEALFLFESR DPLAVGELAA AVNRRRNGDR VFFNVNRHIN HTNICVNRCS FCAFYRAADE PGAYLYDLEE IRNRAAEAHA QGATEIHIVG GLHPDLPFDF YLAMLRTVKE VSPDLHVKAF TAVEIEYLSR LAGLSTAETL TVLKEAGLGS LPGGGAEIFA PAVRNRLCPE KISGDKWLAI MEEVHRAGLK SNATMLYGHI ESYADRVDHM RRLRELQDRT GGFQVFIPLA FQKDNNPLGH LKRPGPGGVD ALLTLAVARI YLDNFANIKA YWVMLGVKIA QTSLAFGVND LDGTVVEEKI GHDAGAASPQ TMGRDEIVSL IRTAGRVPVE RDTLYNELRV Y
|
| |