Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1523 |
Symbol | |
ID | 8136852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1784443 |
End bp | 1785681 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644869135 |
Product | hypothetical protein |
Protein accession | YP_003021337 |
Protein GI | 253700148 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 5.03932e-19 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCGCCC GGGACCGCTT CAGTAAGCTC GGGGGGATCG CCCTGGAGAC GGTGGGATCG CTGCTTCTGG TCCTCGCGCT CTACTGGCTG TTCATGGTCT TTCTCTACGC ACTTTTCCCC TCAGGCACGC CGCTGAAGGA GATGCTGGCG AATCGCGCGG AAGAGCTTCC GGTGAAGGAC GGCGCGGGGC GGCGGCCGGA GGCGGCGCTC AGATCCCTGG TGCGCGACGT CCGCTTCAGG CGCGGCAACT CAGTCGCCTG GGAGGGAGCC AGGGAAGGGA TGCTTCTTTA CAACAACGAC GCCGTGCAGA CCTTCGACCG CTCCGGCGCC ACCTTGTCCT TCGCCCCCAG CGACCGCCTC ACCGTGGGGA GCAATTCCCT GGTGCTGGTC ACCCGCCTGA ACGAAAAGGT GGAGGGGGAG CCTAGGGCCT ACCGGGTCCA GGTGGAAGGG GAGCTGCAGG GAAGCCTTTC CGACGAGAAG CGACTGCGGC TGGAGATCGC CACGGCGGGG CATCTGGCCC GCGTTGCTTC CGGCGCGGCC AGGTTCAGCG TCACCCCCAA CGGCAACGCG TCGAGCCTCG CCGTCTACGC CGGGGAGGTC TGGGTCCAGG GCAGGGACGG GATCGTCCGC GTGCCGGCCT ATCACGGCAT CACCCTGAGA AAAGGGGTCG CGGCGGGGCC GGCGGTGCCG CTTCCCGAGG CGCCGTCGCT CAAAACGGAG AAGCTCCTCT ACAGATTCCG CCTGCTTCCG CCCAAGGTCC GTTTCTCCTG GAGCGGCAAG TCGGGCGAGT ACCACTTCCA GCTGGCGAGA GACCCCCGCT TCAAGAGCCT GGTGCTGGAT AAGAAGCTCG CCGCGCCGGA ACTCGTCACC GGAACGCTGG AAGCCGGAAG CTACTTCTGG CGGGTGAGCG GGGTCATGGA GGCCAGGGAA GGGTTCTTCA GCCGCACCGG GCGCTGCGAC CTGCTGCAGC TTTTGAAACC GCCCGAACTC AAGGTGGAAT TCCCGCCTCA AAGCGCCGCT GCCGGAAACT TCACCTTGAC CGGCAGCGTC GAGCCCGGCG CGCGGGTCTT CGTGAACGGC GTCGAGGTGT CCGGCGCCGG CGACGGGGCT TTTGCCCACG ATCTCAGGCT GAAAAGCGGC GTCAACCTGA TCAGGGTGGA GGCCGTCGAC CAGGCGGGAA ACGCCAGTTA CGCCTCCCGG GTCGTCTACG GGGCAGGTGC CGGACAGCAA GACAGATAG
|
Protein sequence | MSARDRFSKL GGIALETVGS LLLVLALYWL FMVFLYALFP SGTPLKEMLA NRAEELPVKD GAGRRPEAAL RSLVRDVRFR RGNSVAWEGA REGMLLYNND AVQTFDRSGA TLSFAPSDRL TVGSNSLVLV TRLNEKVEGE PRAYRVQVEG ELQGSLSDEK RLRLEIATAG HLARVASGAA RFSVTPNGNA SSLAVYAGEV WVQGRDGIVR VPAYHGITLR KGVAAGPAVP LPEAPSLKTE KLLYRFRLLP PKVRFSWSGK SGEYHFQLAR DPRFKSLVLD KKLAAPELVT GTLEAGSYFW RVSGVMEARE GFFSRTGRCD LLQLLKPPEL KVEFPPQSAA AGNFTLTGSV EPGARVFVNG VEVSGAGDGA FAHDLRLKSG VNLIRVEAVD QAGNASYASR VVYGAGAGQQ DR
|
| |