Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1205 |
Symbol | |
ID | 8136530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1409638 |
End bp | 1413042 |
Gene Length | 3405 bp |
Protein Length | 1134 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868819 |
Product | hypothetical protein |
Protein accession | YP_003021024 |
Protein GI | 253699835 |
COG category | [S] Function unknown |
COG ID | [COG3523] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 161 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGAAAG AGACCATCCT CGTCTGCTGC AAGTACGTCC TTTTCGCCGC AGCCTTTGTC CCCTTCCTGG TGATCGCGTT CGTGATGTTT CCGCTCCTCA CCTGGTCCTG GCGTATCGGC GCCTTTTTCC CGTTTTTTGT CCTTGCCCTG TGGGGGATTT TGGCGCTGAT CAAAAAAGCG GTGCTCCTGG TGAGGGGGCG GCGGCAGGCG CCTGCAGCGG TCAACCCTCC GCAACCAAGC GAGGAAGGGG AGGCGCGGGA TCCGCTGGCC GACCTGCAGC GGCACTGGGC GGGTGGGCTG GAGACCTTGA GGCGGTCGCA CCTGAGCCAG GAGGGAGACC CGCTCTACGT GCTCCCCTGG TTCCTGGTCC TTGGCGAGAC GGGATCCGGA AAGAGTGCCG CCCTCAAGGA CGCCCGGCTT TGCTCCCCCT TCCCCGAGAC GGAGCACGGT GCGGCGCCGA CCAGCAGTTG CCGCTGGGTG TTCTACGACC AGGGGGTGCT GATCGACACA GCCGGTAGAT ATGCCGTCCC GGTGGCTGCG GAGCGCGACG ATGCTGAGTG GCGCCGGCTT CTGTGGCTGT TGAAGAAACA CCGGCACGTA GATCCTCTTA ACGGCGTGAT CCTGACCCTT CCCGCGGACA AGCTGGCAAG GGGGGCGCGA GAGGAACTGG AGAACTATGC GCGCGCGCTG CGGGCCCGGC TAGACGAAAT GATCCGCCTG CTCGGCATCA ACTTCCCGGT CTACCTCCTG ATCACCAAAT CCGACCTGAT CGAGGGGATG GAGCCGTTTT GCGTGGCACT CCCCGCCTCG ACGCTCGATC AGCCCATGGG GCTGGCGAAG GAGGAACTAT CGACGGATCT GTCACTGTTC ATGGAACGCT TCTCCCTTGG GCTCGACGAG GCGTTGAGAC GCCTGCGTCT GATCCTTTTG CAGCAAAGGG CACCGGGAGC CGGAGCCGCG AGCTTCCTGC TGTTCCCCGA CCGGATCCGG GGCCTGTACG CTCCGCTGAA CGCCTTCGTG CAGGCGGCAT TCGGGGCAAA CCACTACCAG GAGACTCCCC TTTTGCGCGG CATCTACTTT TGCAGCGCCG ATCCCGCGAA AACCCCGCTG GCGCCAGGAG AGCCTTCCTC ATCCGCACAT CCGCTTTTCC TGCATGAGTT CTTCGAGAAG GTTCTCCCCG GCGACCGGGG GCTGTGGGCT CCGGGAGCGC AGGCCCTGAA ACGGCAGAGG GCGGTACTTA ATTGGGCACT CGCCGGCTGG GGCCTGACCG GACTCGCGCT GTGCCTGATG CTGAGCTATT CCTTCGCGAA AAACATAAGG GTGATCCGGG ACGCCTCACA GCTGGTGTCG CGCGCGCAGG AGCTGCAGGG AACGGTCCCG ATCGACCTGC CGGCGATGGG ACGTTTGAGC GGCATGATTT CCGAGGTCGA GCAGCGGAAC CGGGAGTGGT GGATCCCGCG GTTTGGCCTC GATCGCAGCA AAGAGATCGA GTTGGAGCTT AAGGCCCGCT TTTGCAGCGG CTTCCGCGAC CGCTTCCTGG TCTTTTTCGA CCGCAGCCTT GCCGATACCG TTTCCAGTTT TTCCCCCAGT ACCCCGGACC AGCTCTTTGG AACCTGCGTG ATGCACCTCA GCCGACGCTG CAATCTGCTG AAGGCTCGGC TTGAGGGGGA GAACTCGCGG GCGCTGGCCT CACGGCCGCT TCCCGACTAC CCGTTGATAT TGCTGCACCA GCAAGCCGGA GGCCCTGACT TTGGAAACCT CTACCTCGAC TACCTCAACT GGCGCGCCGA TCGCGAAGGG ATCAACAAGG AGCTGGAGTG GCTGCAATTG CTGCTGAAAC AGGCCTATGC CGTGAAGGGA AGCGACCTTT CCTGGCTTTT GGAGTTCGTT GACCGCCTGC ACCCCGAGGC GGCGATAACG CTGCAGCAGT TTTGGGCTGG GTCACGGCCG CTGCCGCAAG AGCCGGCGAT TCCCCCGTCG CTGACGGCCA AAGGAAGTGA GCGGATCCGG GCGATGCTCG CGCAGATAGC TGCCGCTCAT CCCGAGCCGG GGCTACTGGC TAGGGAGAAG GGAGAATTCG AGGCCAGGTA CCGCGCCGCC TGTTTCACCG CATGGCAGCG ATTCGCCTGG GACTTTCCGA AAGGGGACCA GCGCCTGGTG GCGCCCAAGG AGTGGCGCTA TGCCGCTGCC AACATGGCGA GTGACAAAGG TCCGTACCTC AGTTTCATGA GGAGGGCTCT GGCGGAGCTT CAGCCTCTGG CCGGCGCCGG CCATCTTCCC GCCTGGGTCA TGCAGATGTA CCGTTTCCAG CTGTTAAGGG CAGCGGGACC TGCTGCCGGC GTCGCATCCT CTGCGGCACC CGCAGCCGAA GGCGTGGCCA ACCGCCTGCA GAAGTTGGCA ACCGGGAAGG GTAGCGGTCC GGACGGGGTT CCCGGCGCCG ATGTGACCCG GGAATATCTC GATGCGTTGG CTCGCGTCGC CCCTGTCGCA AAATCAAGAT CGCTCGCGCT CCAGATGGCG CGGGAGGCGT TCGCCGACTC TCAGGAACAA AACAAGTCGC CGCTTTTCAT TGCCGCCGAT GCCGCTCAGA GGCTAAACGC GCTCCTGGTA CAGGCTGACG GCGACGACAC TTTCCTGCGG CTGGTTTCCG GGCCCATCGC CTTTTACGGG ACCTTCGTCA GGATGGAGAC CGCCTGCGAG CTGCAGGCCC AGTGGGAAGA GAAGGTGCTG AAGGAAGTGC AGGGGGCAAA CGACGCCCAG ACGCTGCAGT ACCTCTTGGG GAAGGACGGC CCTGTCTGGC GCTATGTCGG CCAGTTCGCG GATCCGTTTC TCGGGTGGAG CCCGGGACGC GGTTACTATG CCCGGTCCGC CCTCGGGGGG GCGGTCCCCT TCAGCACCGA GTTCTATGCC TTCCTGGCCA AAGGAGCGAA AACGAAGATC GCTGCCTCCG CCCCCACCAG GGGGAGCTAC CAGGTCACCA TCAAGGGGCT TCCCACCGAC GCCAACGCTG AGGCGCGCGT GAAGCCGCAG GGGACCCGGC TCGAACTGCA ATGCGCGGCA GGTTCGCAGG TGATCTCCAA CATGAATTAC CCCGTGAGCA GGCCCTTCGC CTGGTCGCCC GACAGTTGCG GCGACGTGCT GTTCCACATC GAGATCGGCG ACACGCTCCT CACCAGGCGA TATCCCGGGA CCCGCGGGTT TGTCGCGTTC CTCAGAGACT TCCCAGGCGG CAGGCACACC TTTTACCCCA GCCACTTTCC CGCCGAACGC GATGCTTTGG AGAAGATGGG GGTCCGCTTC ATCAGGGTGA ACTACCAGAT CTTCGGCGCC GGCGATATCC CGGAAGGGGA AGGGGAGACG CTCCCCGGGC GGGTACCCCG GAAGGTAGCC GAGTGCTGGG ATTGA
|
Protein sequence | MKKETILVCC KYVLFAAAFV PFLVIAFVMF PLLTWSWRIG AFFPFFVLAL WGILALIKKA VLLVRGRRQA PAAVNPPQPS EEGEARDPLA DLQRHWAGGL ETLRRSHLSQ EGDPLYVLPW FLVLGETGSG KSAALKDARL CSPFPETEHG AAPTSSCRWV FYDQGVLIDT AGRYAVPVAA ERDDAEWRRL LWLLKKHRHV DPLNGVILTL PADKLARGAR EELENYARAL RARLDEMIRL LGINFPVYLL ITKSDLIEGM EPFCVALPAS TLDQPMGLAK EELSTDLSLF MERFSLGLDE ALRRLRLILL QQRAPGAGAA SFLLFPDRIR GLYAPLNAFV QAAFGANHYQ ETPLLRGIYF CSADPAKTPL APGEPSSSAH PLFLHEFFEK VLPGDRGLWA PGAQALKRQR AVLNWALAGW GLTGLALCLM LSYSFAKNIR VIRDASQLVS RAQELQGTVP IDLPAMGRLS GMISEVEQRN REWWIPRFGL DRSKEIELEL KARFCSGFRD RFLVFFDRSL ADTVSSFSPS TPDQLFGTCV MHLSRRCNLL KARLEGENSR ALASRPLPDY PLILLHQQAG GPDFGNLYLD YLNWRADREG INKELEWLQL LLKQAYAVKG SDLSWLLEFV DRLHPEAAIT LQQFWAGSRP LPQEPAIPPS LTAKGSERIR AMLAQIAAAH PEPGLLAREK GEFEARYRAA CFTAWQRFAW DFPKGDQRLV APKEWRYAAA NMASDKGPYL SFMRRALAEL QPLAGAGHLP AWVMQMYRFQ LLRAAGPAAG VASSAAPAAE GVANRLQKLA TGKGSGPDGV PGADVTREYL DALARVAPVA KSRSLALQMA REAFADSQEQ NKSPLFIAAD AAQRLNALLV QADGDDTFLR LVSGPIAFYG TFVRMETACE LQAQWEEKVL KEVQGANDAQ TLQYLLGKDG PVWRYVGQFA DPFLGWSPGR GYYARSALGG AVPFSTEFYA FLAKGAKTKI AASAPTRGSY QVTIKGLPTD ANAEARVKPQ GTRLELQCAA GSQVISNMNY PVSRPFAWSP DSCGDVLFHI EIGDTLLTRR YPGTRGFVAF LRDFPGGRHT FYPSHFPAER DALEKMGVRF IRVNYQIFGA GDIPEGEGET LPGRVPRKVA ECWD
|
| |