Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1721 |
Symbol | |
ID | 8137052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2002184 |
End bp | 2003329 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869333 |
Product | hypothetical protein |
Protein accession | YP_003021533 |
Protein GI | 253700344 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.00204898 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATCGC CCCTGCACCT CATCATGATT CTGTCGGCCG CCTTGGCGCT TTGCGGCTGC TCCGGACCGG AACCCGGCTC CACAGGCCTT GCCGGCTCCT CCGTGAAAGC CCACGCCAAA AAAGCCGCTT ACGGCAGCTA TCGCTTCGGC ATGACCCAGG GGATCGACAT CGGCGCTCAA CCTCTGACGC TTCCTGAGTT TTCGGTCGCC GAGCTCATGG CGCGCGACCG CGTGCTGGCC GAGCGCCTGC ACCGCGGCGG GATGCGACTG CAGATGCTCC CCTTCTACAA CGGTAAGGAC ATCGGCGATT TCCTCTCCAC CGGTGAACTG GAGGGGGGGA TATTCGCGGA CATGCCGGCC TTGACGGCTG CCGCCGGCGG AGACGTGGTG CTGCTTGCCA TGCTGAAACA GGGGGCCGCC TCCATCGTCG CCAGGGCTCC CATGCTGGTG AAGGATCTGG ATGGGAAGAG GGTCGGGGTC ACCACCGGCA GCGCCGCCCA CTTCACCCTT CTGCGGGCGC TGGGCAACGC AGGGTTGGCC GAGAAGGACG TCGAGCTGGT GCCGATGGAG GTGAGCGAGA TGGCCCGGGC GCTTGCCGAC GGCAGGATCG ACGCCTTCTG CGCCTGGGAG CCGACCCCTT CCATAGCCTT TTCCTCCTAT CCCGATTTCC ACCTGGTCCA CAAGGGGCTC AACTACGGGT TCCTCTGCCT GCGACGCGAC TTCGTGAACA GTCATCCCGG TGAAACCAGG GAAATCCTCG CCGCCGTCGC CAGGGCATGT TTCTGGATGC GGGAGGGGGG ACAGATGCGG CAACTGGCCC AGTGGACGAC GCAAGCGGCG ACGAAGTTTC AAGCTGAGCC CTTTGCCCTG AAGCCTGAGC AGATGATGTC CATCACCCGC CGCGACCTGC TCGACGTCCA ATCCTCCCCG CGCATACCGG AGATGCTCCT GCGCGAGCAG GAAGTGCTTT ATCAGAAGTT CCTTTTCCTC AAGAAGATAG GAAGGATACC GGAGACCGCC TCCTGGGCCA AGGTGCGCGG TTCCATAAAT TTAGCGATGT TGCGGGAGGT CATGGCCGAC TCCGACAGGT ACGCCCTGAG AGGGTTCGAC TACCGCGGCA ATACGGAAAC GGATGGAACA AGATGA
|
Protein sequence | MKSPLHLIMI LSAALALCGC SGPEPGSTGL AGSSVKAHAK KAAYGSYRFG MTQGIDIGAQ PLTLPEFSVA ELMARDRVLA ERLHRGGMRL QMLPFYNGKD IGDFLSTGEL EGGIFADMPA LTAAAGGDVV LLAMLKQGAA SIVARAPMLV KDLDGKRVGV TTGSAAHFTL LRALGNAGLA EKDVELVPME VSEMARALAD GRIDAFCAWE PTPSIAFSSY PDFHLVHKGL NYGFLCLRRD FVNSHPGETR EILAAVARAC FWMREGGQMR QLAQWTTQAA TKFQAEPFAL KPEQMMSITR RDLLDVQSSP RIPEMLLREQ EVLYQKFLFL KKIGRIPETA SWAKVRGSIN LAMLREVMAD SDRYALRGFD YRGNTETDGT R
|
| |