Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1828 |
Symbol | |
ID | 8137159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2127963 |
End bp | 2129300 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869439 |
Product | Radical SAM domain protein |
Protein accession | YP_003021639 |
Protein GI | 253700450 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.00146977 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAACTTT CCAGGTATTT GAAGATATAC CCCGACCAGG GGCGCCCAGA TCACCTGCTT CTGTTCTCCA CCTTCCAACT CTCTGTTGCT CGCGTTCCCC GCGCCACCCT GGATGACGCG CTGCAAGGCC CCCCTTCTGC CGAACGTGAC GCGCTGGTAC GACTGGGGAT GCTGACCGCG GACCCTGCAG CGGAGAGGAA GCGGATGCGA TTTTTTTTAG ATCGAGCAAA CGAGCGCTCT CGCCACTTTA AGTCCATGGT GGTGTTGAAC CTCGACTGCA ACCTCGACTG CGGCTACTGC TATGAGGGCG AATTCCGCGG CGGGCACTAC ATGTCTAGCG CCACGGCGGA CCTATTGGTT GAGACGCTGC TGCGGGAGAG GATCTCCAAG GGGTGGGACG TCACCCTCTC ATTTTATGGC GGGGAGCCCT TGCTCTCCCA GGACCTGATC GGAAGGATCT CCGCGCCGCT TTTGCAGGCA GCCCGCGACC ACGGAGTAAA GTACGGCTTC AACCTGGTTA CCAACGGCAC GCTCCTGAAC CGCGACACTG CGTTAGAGCT CATACCGCTC GGGTTGCAGG GAGCGAAGTT CACCCTGGAT GGTCCGCGCG AGATTCACGA CGGCGAACGT CCCTACGCTT CCGGCGCTGG GAGCTTCGGC ACGATCGTGG ACAACCTCTC CGAGATCTGC GACCTCCTCC CGATCCAGAT AGGCGGGAAC TTCCGGCGCG AGAATTACCG CGACTTCCCG CGCCTTCTGG ATCAGCTCGC CTGCCGCGGC ATCACCGGGG AAAAGCTCCA ACTGGTCCAG TTCACACCGG TGACGCCGAA GGCGGGTTGC TCCGAGTATG GCTCGGGATG CGCATCGTCG AGCGAGCCGT GGCTTGTGGA GGCTCTCCTT TACCTGCGGG AGGAGATACT TTCAAAGGGG TACAAGACCG GTAAACCTTC GGTCTCCGCC TGCATCGTCG AATTCCAGGA CAACATCGTG GTCAACTGCG AGGGCCGATA CTTCAAGTGC CCGGCTCTGA TGGGATGGGA AGGATACAGC GTAGGGAGTC TCGCCGAGGG GATTAAGGAC TACCGCCAGT CGCACGGCAT CGGGAACTGG CAGGCCGACG CCTGCCTGGA CTGCTGCTAT CTTCCCCTGT GCTTCGGAGG TTGCCGCTTC CTTACCAAGC TGCACGGAAA GGGCCTCGAC GAAGTGGACT GCAGGCGGGA GTTCCTCGAT GCCGCGCTGG AACAGATGCT GTTGCAGAAC ATGGCCTACC CGCACAACGC CGCGAAGAAG TCCGCGCCCC CTTCCTCCGC CTCCATCACT GCGCCCGCCC CCTACTGA
|
Protein sequence | MELSRYLKIY PDQGRPDHLL LFSTFQLSVA RVPRATLDDA LQGPPSAERD ALVRLGMLTA DPAAERKRMR FFLDRANERS RHFKSMVVLN LDCNLDCGYC YEGEFRGGHY MSSATADLLV ETLLRERISK GWDVTLSFYG GEPLLSQDLI GRISAPLLQA ARDHGVKYGF NLVTNGTLLN RDTALELIPL GLQGAKFTLD GPREIHDGER PYASGAGSFG TIVDNLSEIC DLLPIQIGGN FRRENYRDFP RLLDQLACRG ITGEKLQLVQ FTPVTPKAGC SEYGSGCASS SEPWLVEALL YLREEILSKG YKTGKPSVSA CIVEFQDNIV VNCEGRYFKC PALMGWEGYS VGSLAEGIKD YRQSHGIGNW QADACLDCCY LPLCFGGCRF LTKLHGKGLD EVDCRREFLD AALEQMLLQN MAYPHNAAKK SAPPSSASIT APAPY
|
| |