Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2000 |
Symbol | |
ID | 8137334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2318202 |
End bp | 2319719 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644869613 |
Product | hypothetical protein |
Protein accession | YP_003021810 |
Protein GI | 253700621 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 6.94208e-35 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAAAAGT CACTGCTTAT TCTTTTCCTT GCGCTCATGG TCCCGCTGAC CTTGAGCCAA GGCACTTCAG CCTGGGCCGA GAGCTACCAG CAGGAGTATT ATCAGACGGG CGCGGACCTA CTGATGCCGG ACGAACTCGA CGAACTGCTG GCCCCCATCG CCCTCTACCC CGACCCGCTG ATCGCGCAGA TCCTGCCGGC TGCGACCTTC GTGGACCAGA TAGACGACGC TGCGCGCTTC GTCAGGCGTT ACGGCCCCAG CCGGGTCGAC TACCAGAGGT GGGATTTAAG CGTCAGGGCC ATCGCCCACT ATCCGCAGGT CATCTACATG ATGGATCGGA ACCTCGATTG GACGGCGTCG CTCGGGCAGG CCTTCATCGA ACAGCCCCAG GAGGTCATGG ACGCCATACA GAGGCTGCGC GACGACGCCC GGGCGGCCGG CAATCTCTAC AGTACGTCTC AGCAGTTGGT GATCGTCGAG GCGGGGATCA TCAGGATCGT TCCGGCCAGA CCGCAGTACG TCTACCTGCC GGTTTACGAC CCTTACGTGG TTTACTACGA GCCCTACTCC CCCTCCTACC CCTTCATCAC CTTCAGCGTG GGATTCACCA TCGGGGCCTG GCTCAACCGC GACTGCGACT GGCGACGGCA TCGGGTGTAC TATCACGGCT GGCGCGGCAC CGGATGGGTA AGCAGGTCGC GCCCGCACAT CCACGACCGG CGCGGGATCT ACATCAATCA GCGCGCCTCC AGCATCACCG TCAATACCCG GGTAGCCCAG CGAGACACCC GGGTCTACCG TCGGGAACTG CGCAACGAGG GGCTGCGCTG GCGCGGCAGG ACAGGCGGGC GCAGCGAACC CGCCCGCGTG CAGACGCGCC GCGGGACCGC GACGCCGGGC GCGAGGATCG AACAGCCGCG CCCGGGTCGA GTAGAACAGC GGCGCAGGGA GCAGACGCCT CAGCCTGGTC CCGGTCGAGT AGAACCGCGA CGTCAGATAC AAGCTCCGCA GCTGACTCCG GGCCGGGTCA AACAGCAACG CCCCGGCAGG ACCGAGAGCG CACCTTCGGG ACGGACACAG CCACGCCCGG GGATAAGGGA GGGGGCGCCG TCCGGCGAGA ACCAGCAGCA CCGCCCTTCG AGGATAGAGC AGCATAAGGC CCCGGCCGTG CAGCAGTCAC CTGCGGCAAC AGGCCGGCAG CCCGCCGGCA CGGAGCAAAA GGTGCGGCGC CGGCAACTAA GACAGATAAA CCGGCCGCCC GCTACCCCTG CCACCGAGGT CCCGCGCGCG ACCGTCCCGG CGCCGAACAC AGCTACCCCG CCGACGAGGG TTACCCCACC CCCGGCCGCC AAACCTCCGG TCGTCCCGGC GCCGGCATCC CCGGCTGCGC CCAGGGAAAT AGAGACGCCT CGCCCCTCAA GGCCGGAACG CGAGCAAGGA GAGGGTTCAG GAAGAGGCGG CCGTGGCGGA GGTGAAAGGG CGGAACCAAG AGGAGGCGGT CAGAGAGAAG GAAGGTAG
|
Protein sequence | MKKSLLILFL ALMVPLTLSQ GTSAWAESYQ QEYYQTGADL LMPDELDELL APIALYPDPL IAQILPAATF VDQIDDAARF VRRYGPSRVD YQRWDLSVRA IAHYPQVIYM MDRNLDWTAS LGQAFIEQPQ EVMDAIQRLR DDARAAGNLY STSQQLVIVE AGIIRIVPAR PQYVYLPVYD PYVVYYEPYS PSYPFITFSV GFTIGAWLNR DCDWRRHRVY YHGWRGTGWV SRSRPHIHDR RGIYINQRAS SITVNTRVAQ RDTRVYRREL RNEGLRWRGR TGGRSEPARV QTRRGTATPG ARIEQPRPGR VEQRRREQTP QPGPGRVEPR RQIQAPQLTP GRVKQQRPGR TESAPSGRTQ PRPGIREGAP SGENQQHRPS RIEQHKAPAV QQSPAATGRQ PAGTEQKVRR RQLRQINRPP ATPATEVPRA TVPAPNTATP PTRVTPPPAA KPPVVPAPAS PAAPREIETP RPSRPEREQG EGSGRGGRGG GERAEPRGGG QREGR
|
| |