Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1522 |
Symbol | |
ID | 8136851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1778492 |
End bp | 1784446 |
Gene Length | 5955 bp |
Protein Length | 1984 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644869134 |
Product | conserved repeat domain protein |
Protein accession | YP_003021336 |
Protein GI | 253700147 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00000000000121408 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGATCG CTGCAGGCAA GAAGAGGAGG AGGTGGGGGG GGCTGCGCCT GATCGCGCTC GCGTTCCTTC TTGCCTTTCC CGGTACTGCA TTTCCGCTTA CCCCGGCAGG TACCGTCATC AGGCACGATT TCACCGCACG CTTCGCCACC CCCGCTCTCA CGGAGGTCCG CTCCAACGAA ACCCAGGTGA GCGTGAAGGA CCTGTCCGAC CCGGAGCTGG TACCCCCGCG TACCGCCACC ACGAACGCCA CAGTTCCCGT CGATTTCCGT CACACCCTCA CCAACCGGGG GAACTTTGCC GACAGTTTCA GGTTGAAGGC CGCGGCGGTC GACGCGGCCC TGCCGGAGAA CGGACAGGCG CCGCTGTTCC GCTTTTACGC CGCTGACGGC GTGACCCCGT TTCCCACCGA TGCCGGCGGA TTGCAGGTAG TAGGCCCCCT TGAGCCCGGC GCTTCCGTCG AGCTGGTGTT GAGGGCCACG CCGGCTGCAG GGAGCGAGGG GCGGGTGGCG TTGATCGCGG TCAGCGCCAC CTCCGTGCTG GTGCCGGCGC GAAGCGCGAG CCTGACCAAC CAGCTACAGG TGCCGGCCGG AGGTGTTTCC TTCCTCAAGG CCGTCTCCCC TGTGGGGAGC GTACTTCCCG GGACCCTCCT GACCTACGGC ATCACGCTTG GGAACAGCGG GGGAGCGCCC GTGGCCGGTG TGCGGGTGGT CGATCCTTTG GATCCCCTTC TCGAATACCA GCCTGGGAGT GCGGCGCTTC CTGCCGGAGT TCCCGGAAGC GTCAGTTTCG ACGCCGCAAC CCGGACGCTC GTCTTCGACA TACCGTCGCT CCCCGCCGGA TTCAGCGGGG AGATCACCTT CCAGGCCCGC GTACGTGACG ACGCCCCGGG AAATGCCTCC ATCGTCAACA CCGCCGGCGT CTCCACTGCG CAAAATGCTG CACCCTCCCT CAGCAACAGC ACCCTCAACA CAGTGCTGGT CAGATTGCTC CAGGTGACCA AGGCTGCGGG AAGCAGCGCT GCCGACGCCG GCGACATAGT GTCGTACACG GTGCGCGTGC AGAACGTCGG CGCAGGCGTG CTGAACCATG TCACCGTAGG CGACCGTATT CCCCGCGGTT TCCGCTATCT CAAGGGGGCG AGCCATGTGG ACGGGTTCGC CGTGTCGGAC CCGGTGGGAG GGGGGCAGCA GTTGAGTTGG GACCTGGGGA GCCTGGAGCC CGGAGCCTTC AAAGTGCTCA ACTACCGGTT GGTTGTTTCC GCTGAGGCCC CCGTCGGGAC CGCCGTCAAC TGGGCCCGGG CCACCGGCAC CCCTGCCGGC GGCGGCAGCG TCGTCTCCCC TCCCGCCAGC GCCTCCGTCA AGGTGAGGTC CTCCGTTTTG GGGAGCAAGG CCGTCATCAT GGGTAGGGTC TTCCATGACG GCAACGGGAA CGGCGTGCCG GACCAAGGGG AGCGGGGCGA GGCCGGGGTG CGGGTGTACC TGGAAGACGG TTCCTTCGTC TTCACCGACG TGGAAGGGCA GTACAGTTTC ACCGGGGTTT CCGCAGGCGA TCACGTCGTC AAGCTCGACC GGTCCACCCT GGACCCGGGT TTCGTCTCCG TGCCGTACAA CACCGCCTTT GCCGGGGTGG GGTGGTCGCA GTTCATCACC GTCCCCTTCG GCGGCCCGGC ACGCGGCGAC TTCGCGCTCG CCGGGAAAGC GGAAGCGTCG GCACCGCAGC CGGCAGGTCG GACAGGTCAG ACAAGTCCGA CAAGTCCGAC AAGTCCGACA AGTCCGACGC TGTTGGCGCC TCCGGCCGCC GAACCGGATG CCGTGCCCCG GCCCCGGCTC CGGCTGACCC CTGCTCTGGT GGAGCTTCCG GCGGACGCCA AAACCACGGT CCCCTTCAAG GTCGAGCTCC TGGACCCCGC CGACAGGCGG GTCCCCGGCT CCGCCACCGT CACCCTCTTC CTCGCCAAGG GGACCCTGCT GGAACCGGAC CTGGATCCGG CCCTCCCGGG CCACCAGATC CGCGTCACAG ACGGCCTCGG CGTCTTCCAG GTTCGCTCCG GCCGCGCCAC CGGCAACGAC CTCCTCATCG CGAAGGACGG CAACGGCAGC GCCGCCCAGT CAGAGCTCTT CTTCAGCCCA TACCGGCGCG ACTGGATCCT GGTCGGCCTG GGCTCCCTGA CCGTAGGGGG GAAGGCGGTA AGCGGCAACC TGGAAAAGAT CGACAAGGAC GACCGCTTCG ACGAGGGAAT CTTCCACGAG GAGCGGCTCG CCTTTTTCAC CCGCGGCAAG ATCCTCGGCA AGTACCTCCT CACCGCCGCC TACGACTCGG ATAAGGAGCG CCGCGACGGC GTCTTCCAGG CCATCGACCC GGAAAAGTAC TACCCTGTCT ACGGCGACGC CACCGACATA GGGTACGAGG CGGAGTCCCG CGGCAAGATC TATCTCAAGC TGGAAAGCGG CCGCTCCTAT CTCATGGCCG GCGACTACCG CACCGACCTC TCCGAAAACG AATTCTCCCG CTACGACAGG GCGCTAAACG GCGCCAAGTT CGAGGTCAAC ACCAAATCGG TGACCGTCAA AGGGTTCGAA AGCCGCACCG AGGAAACGGT GACGCGCGAC GACATCCCCG GCAACGGCAC CTCCGGCCAC TACTTCCTCT CCAGGCGGGG CGCCTTCGAG AACAGCGAGC GCGTGCGCAT CGAGGTGCGC GACCGCTACC ACACGGAACG CGTCATCGCC GTCGCCGAGA AGGTCCGCTA CGCCGACTAT TCCATCGATT ACCAGGCCGG GACCATCCTC TTCAAGGAGC CGGTCCCTTC CCTGGACCAG TTCCTGAATC CGGTCACCAT CGTGGTCAAT TACCAGGCCA CAGGCCCCGG GGAGAAGCGC TACGTCTACG GCGGACGCGC CGAGATCCGT TCCGAAAACG GCTCTTATTT CGGCGGCACC GCCGTTGTGG AGGAGCAGGC GCGGCAGGAT AACACCCTCT TTGGCCTGGA CGGAGCCTGG CGCCTGGGCG AGCGCCTGAC CCTGAAGGGG GAGGGGGCGG TCAGCGAGAA CCTGGAACAC GGCAGGGGGA GCGCCTGGAA GACGGAGCTC GCCGCGCGAC CCATCGATCC CCTCCTCTTG AACCTTTACT ACCGGAAGGT GGAGTCCGAC TTCTTCAACA CCTCGATGAC CGGGACGGAG CTTGGGACCG AGAAGTACGG CGGCCGGGCC GACTTCCGCA TCGGCGCGGA CGCCCTCGTC TTCGCGGAGA GCTTCCTGCA CCGCTTTGAA CTGGGGGACC GGAAACTCTT CGCGAACCAG GTTGGGGCGG TGAAGAAATT CTCCCTGCTG CAGCTGGAGG GGGGCGTCAA GGTGGTGCGG GAGGATCTGG ACGGACGGAC CGAGGGATCG GACCTGGTCT ACGCCGGGGT GATCGCGCCT CTCACCAAAA GACTAGATGC GACGCTCAGG CGCGAGCAGC TTCTTTCGGC CTCCGGCGTC GCCGACTACC AGACCAAGAC CTTCGCCAAG CTCGACTATC GTCTGACCGA GCGCACCAAG GCCTTCCTCA CCGAGGAATA TCAGGAGGGG TCTCCCCTCA TCCGCCAGGC CACCCTGTTC GGCCTGGAGA GCAGGCTCTC CGACCGGATG CGGCTCACCA CCGGCTACCA GATGTCGAGC GGCGCCTCCG GGTACTCCCA GCAGAGCAAC GTCGACCTGA ACACGCGGCT CATGGAGCGG GAAGGCTTCA GCCTCGATTC ACGTACCGGC TACCAGATCG AGCACGCTCT GTCGCAGCAA AGGGGGCAGG CCATCGTCGG GTTGAACAGC CGTTACCGCG CCGCCGAGGG GCTGTACCTC AACTCCTCCC TGGAGCGGGT GCAGACGGTG CAGGGAAATA CCGGCACCCG CACCGCTTTC ACCTTGGGGG GGGAGTATCT CCGGGCAAAA GACCTGAAGC TCTCCGGCCG CTACGAGGTG AGGACCGGCC CCGGAGAGAC CGCCTCCCTC TACGGCGCCA ACGCCGCCTT CAAGTTGAGC CCGTCGCTAA CCCTCTTGGG CAAGCTCTCG CTCTGGGACC GGGACGCCGA CGCAGGCGAC GACGTCATCT TCGACGGCTA CCTGGGGAGC GCCTTCCGCC CGCTGGCGGG CCGGCCGCTG CAACTCCTCA CCCTGGCGCG CTACAAGGTG GAGGACAGAC GCTCACTCCC CGGCTCCTTC GAAAGCCGCA GCGTCATCCT ATCGGCCGAG CCGACCTACA GGATCGTGAG CCGCTGGAGC GCCCAGGGGA AATACGCAGG GAAGCTCAGC TGGGCCGACG GGGTGGGGGG AATGATGCAG AGCTACACCG ACCTGGTCCT GGCCGGGCTC TCCTACGACC TGGCGCAGCG CTGGGAGATC GCCGCCTACC TGAAGCTCAT GAACCAGTAC GACGCCGGGA TGCACTCGAT GGGAGCGGTG GGGAGCGCCG GCTACCGCGT CTATAGAAAC GTCGTCCTCT CCGCCGGATA CAACTTCGCC AGGCTCGACG ACCGGGACTT GACCGGCGAA ACCTTCCAGG GGCAGGGGCC CTTCGTCGGC ATCAAGGTCA AGTTCGACGA GGACATGTTC GAGTCGCAGC AGGCGAGGGT GATCCCCATC CCGGTGCCGC CCCCGGCCCC TGTCGCGAAG CTTCCCCCCG CCGCGGCTCC CGAGCCGGTG CCGGCCCTGC TGGTGCGGGC GGAGCGGCTC GACGAGCCGC TTTTTCTCTC CGGGAGCGCG GAACTCTTCA CCCTCCTGGT CAACGGCGAG CGGGCGAAGC TTCCTTCCAC CGAGGTGACG CTGACCCGGG AGCGCCTGGG TTCCCTGGAG CTCAAGGGGG GAAGGTTCCC CGCGCCGCTG GTCTTCCTGG TGAGCGTGGA GCAGCCGGAA CAGGTCGGCT CCTGGACGCT CAAGGTGATG AACCGCGAGG GGGAGGCGCT GCGCACGCTT GAGGGGACCG GGGCGCCGGC AAAGCGGATC CCGTGGCTTG GGGAGACGGA CCGCCGGAAG GTGGAACAGG GGGAGATCTA TCAGTACCAG CTGCAGGTGA CCTACCTGGA CGGCTCGATA TTCAGCACCG GGCGGGAGCT CTTCGGCGTC AACCGCCGCG AGGCGATACT CCTCACCCTC TCCGGCGGCG CTTTCGTCTT CGACCGCTCC GAGCTGACCC TGGAGGCCAA GCGCCTGCTC AAGGGGGCGG CGCGGGTCCT GCGGGCGCGC CCGCGGGAAA AGGTGATCGT GGAGGGGCAC ACCGACGGCA TCGGGAGCGT GGAGTACAAC ATGGCGCTCT CCCAGAGACG GTGCGACGCC GCAGCCGACT ACCTGGTGCG GGAGGAGGGG ATAGCCCGCT CGCGGCTGTT GCGGCGCTGG TACGGGAAAT CCCGCCCCGT GGCCGACAAC GTAACTTCCC CCGGCAGGAG AATGAACCGC CGCGTCGAGC TGAAGGGGGA TTTCAAAGAG CTGCATCCGG TCTCTCCCGA CGACCGTTAC CGGACGAAGC CCTTCGTGTT GATCAACGGC CGCAGCATCC CGGTCGATCC CCTGGGGCGC TTCGACACCA CCCTCCCGGG CCACACCCTC GATCTCGACC TCGAGATGGG GGATTCGCAG GGGCGCTTCC TCGCCACTTC GCTCCCGCTT CCCGATCTGG ACGTGACCGG GCCTGCCGTC GAGACGCTGG TGGGGTACGG TACTGAAGCC TCCGGCGTCA GGGTCGATGC CGACGGCAAG GCCCATTGCA TGCTCTCGGG AACGGTCGAG GGAACCTCCA TGGAACTGGC GGGGAGGAAG GTGCCGCTCG ACGAGGCAGG GCGCTTCACG CTCGATCTCC CGCTTGCCGA GGGGGACCAG GTCCTGGGCG TGGTGCTTAG AAATGGCTCG GGCTGCTCAA AGCTCATGAA CCTGCGGCTC CGCTCGGAGC GCCAGACCTT GCCTCCCGCG CGGGGTGAGC GGTGA
|
Protein sequence | MTIAAGKKRR RWGGLRLIAL AFLLAFPGTA FPLTPAGTVI RHDFTARFAT PALTEVRSNE TQVSVKDLSD PELVPPRTAT TNATVPVDFR HTLTNRGNFA DSFRLKAAAV DAALPENGQA PLFRFYAADG VTPFPTDAGG LQVVGPLEPG ASVELVLRAT PAAGSEGRVA LIAVSATSVL VPARSASLTN QLQVPAGGVS FLKAVSPVGS VLPGTLLTYG ITLGNSGGAP VAGVRVVDPL DPLLEYQPGS AALPAGVPGS VSFDAATRTL VFDIPSLPAG FSGEITFQAR VRDDAPGNAS IVNTAGVSTA QNAAPSLSNS TLNTVLVRLL QVTKAAGSSA ADAGDIVSYT VRVQNVGAGV LNHVTVGDRI PRGFRYLKGA SHVDGFAVSD PVGGGQQLSW DLGSLEPGAF KVLNYRLVVS AEAPVGTAVN WARATGTPAG GGSVVSPPAS ASVKVRSSVL GSKAVIMGRV FHDGNGNGVP DQGERGEAGV RVYLEDGSFV FTDVEGQYSF TGVSAGDHVV KLDRSTLDPG FVSVPYNTAF AGVGWSQFIT VPFGGPARGD FALAGKAEAS APQPAGRTGQ TSPTSPTSPT SPTLLAPPAA EPDAVPRPRL RLTPALVELP ADAKTTVPFK VELLDPADRR VPGSATVTLF LAKGTLLEPD LDPALPGHQI RVTDGLGVFQ VRSGRATGND LLIAKDGNGS AAQSELFFSP YRRDWILVGL GSLTVGGKAV SGNLEKIDKD DRFDEGIFHE ERLAFFTRGK ILGKYLLTAA YDSDKERRDG VFQAIDPEKY YPVYGDATDI GYEAESRGKI YLKLESGRSY LMAGDYRTDL SENEFSRYDR ALNGAKFEVN TKSVTVKGFE SRTEETVTRD DIPGNGTSGH YFLSRRGAFE NSERVRIEVR DRYHTERVIA VAEKVRYADY SIDYQAGTIL FKEPVPSLDQ FLNPVTIVVN YQATGPGEKR YVYGGRAEIR SENGSYFGGT AVVEEQARQD NTLFGLDGAW RLGERLTLKG EGAVSENLEH GRGSAWKTEL AARPIDPLLL NLYYRKVESD FFNTSMTGTE LGTEKYGGRA DFRIGADALV FAESFLHRFE LGDRKLFANQ VGAVKKFSLL QLEGGVKVVR EDLDGRTEGS DLVYAGVIAP LTKRLDATLR REQLLSASGV ADYQTKTFAK LDYRLTERTK AFLTEEYQEG SPLIRQATLF GLESRLSDRM RLTTGYQMSS GASGYSQQSN VDLNTRLMER EGFSLDSRTG YQIEHALSQQ RGQAIVGLNS RYRAAEGLYL NSSLERVQTV QGNTGTRTAF TLGGEYLRAK DLKLSGRYEV RTGPGETASL YGANAAFKLS PSLTLLGKLS LWDRDADAGD DVIFDGYLGS AFRPLAGRPL QLLTLARYKV EDRRSLPGSF ESRSVILSAE PTYRIVSRWS AQGKYAGKLS WADGVGGMMQ SYTDLVLAGL SYDLAQRWEI AAYLKLMNQY DAGMHSMGAV GSAGYRVYRN VVLSAGYNFA RLDDRDLTGE TFQGQGPFVG IKVKFDEDMF ESQQARVIPI PVPPPAPVAK LPPAAAPEPV PALLVRAERL DEPLFLSGSA ELFTLLVNGE RAKLPSTEVT LTRERLGSLE LKGGRFPAPL VFLVSVEQPE QVGSWTLKVM NREGEALRTL EGTGAPAKRI PWLGETDRRK VEQGEIYQYQ LQVTYLDGSI FSTGRELFGV NRREAILLTL SGGAFVFDRS ELTLEAKRLL KGAARVLRAR PREKVIVEGH TDGIGSVEYN MALSQRRCDA AADYLVREEG IARSRLLRRW YGKSRPVADN VTSPGRRMNR RVELKGDFKE LHPVSPDDRY RTKPFVLING RSIPVDPLGR FDTTLPGHTL DLDLEMGDSQ GRFLATSLPL PDLDVTGPAV ETLVGYGTEA SGVRVDADGK AHCMLSGTVE GTSMELAGRK VPLDEAGRFT LDLPLAEGDQ VLGVVLRNGS GCSKLMNLRL RSERQTLPPA RGER
|
| |