Gene GM21_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1522 
Symbol 
ID8136851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1778492 
End bp1784446 
Gene Length5955 bp 
Protein Length1984 aa 
Translation table11 
GC content67% 
IMG OID644869134 
Productconserved repeat domain protein 
Protein accessionYP_003021336 
Protein GI253700147 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00000000000121408 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGATCG CTGCAGGCAA GAAGAGGAGG AGGTGGGGGG GGCTGCGCCT GATCGCGCTC 
GCGTTCCTTC TTGCCTTTCC CGGTACTGCA TTTCCGCTTA CCCCGGCAGG TACCGTCATC
AGGCACGATT TCACCGCACG CTTCGCCACC CCCGCTCTCA CGGAGGTCCG CTCCAACGAA
ACCCAGGTGA GCGTGAAGGA CCTGTCCGAC CCGGAGCTGG TACCCCCGCG TACCGCCACC
ACGAACGCCA CAGTTCCCGT CGATTTCCGT CACACCCTCA CCAACCGGGG GAACTTTGCC
GACAGTTTCA GGTTGAAGGC CGCGGCGGTC GACGCGGCCC TGCCGGAGAA CGGACAGGCG
CCGCTGTTCC GCTTTTACGC CGCTGACGGC GTGACCCCGT TTCCCACCGA TGCCGGCGGA
TTGCAGGTAG TAGGCCCCCT TGAGCCCGGC GCTTCCGTCG AGCTGGTGTT GAGGGCCACG
CCGGCTGCAG GGAGCGAGGG GCGGGTGGCG TTGATCGCGG TCAGCGCCAC CTCCGTGCTG
GTGCCGGCGC GAAGCGCGAG CCTGACCAAC CAGCTACAGG TGCCGGCCGG AGGTGTTTCC
TTCCTCAAGG CCGTCTCCCC TGTGGGGAGC GTACTTCCCG GGACCCTCCT GACCTACGGC
ATCACGCTTG GGAACAGCGG GGGAGCGCCC GTGGCCGGTG TGCGGGTGGT CGATCCTTTG
GATCCCCTTC TCGAATACCA GCCTGGGAGT GCGGCGCTTC CTGCCGGAGT TCCCGGAAGC
GTCAGTTTCG ACGCCGCAAC CCGGACGCTC GTCTTCGACA TACCGTCGCT CCCCGCCGGA
TTCAGCGGGG AGATCACCTT CCAGGCCCGC GTACGTGACG ACGCCCCGGG AAATGCCTCC
ATCGTCAACA CCGCCGGCGT CTCCACTGCG CAAAATGCTG CACCCTCCCT CAGCAACAGC
ACCCTCAACA CAGTGCTGGT CAGATTGCTC CAGGTGACCA AGGCTGCGGG AAGCAGCGCT
GCCGACGCCG GCGACATAGT GTCGTACACG GTGCGCGTGC AGAACGTCGG CGCAGGCGTG
CTGAACCATG TCACCGTAGG CGACCGTATT CCCCGCGGTT TCCGCTATCT CAAGGGGGCG
AGCCATGTGG ACGGGTTCGC CGTGTCGGAC CCGGTGGGAG GGGGGCAGCA GTTGAGTTGG
GACCTGGGGA GCCTGGAGCC CGGAGCCTTC AAAGTGCTCA ACTACCGGTT GGTTGTTTCC
GCTGAGGCCC CCGTCGGGAC CGCCGTCAAC TGGGCCCGGG CCACCGGCAC CCCTGCCGGC
GGCGGCAGCG TCGTCTCCCC TCCCGCCAGC GCCTCCGTCA AGGTGAGGTC CTCCGTTTTG
GGGAGCAAGG CCGTCATCAT GGGTAGGGTC TTCCATGACG GCAACGGGAA CGGCGTGCCG
GACCAAGGGG AGCGGGGCGA GGCCGGGGTG CGGGTGTACC TGGAAGACGG TTCCTTCGTC
TTCACCGACG TGGAAGGGCA GTACAGTTTC ACCGGGGTTT CCGCAGGCGA TCACGTCGTC
AAGCTCGACC GGTCCACCCT GGACCCGGGT TTCGTCTCCG TGCCGTACAA CACCGCCTTT
GCCGGGGTGG GGTGGTCGCA GTTCATCACC GTCCCCTTCG GCGGCCCGGC ACGCGGCGAC
TTCGCGCTCG CCGGGAAAGC GGAAGCGTCG GCACCGCAGC CGGCAGGTCG GACAGGTCAG
ACAAGTCCGA CAAGTCCGAC AAGTCCGACA AGTCCGACGC TGTTGGCGCC TCCGGCCGCC
GAACCGGATG CCGTGCCCCG GCCCCGGCTC CGGCTGACCC CTGCTCTGGT GGAGCTTCCG
GCGGACGCCA AAACCACGGT CCCCTTCAAG GTCGAGCTCC TGGACCCCGC CGACAGGCGG
GTCCCCGGCT CCGCCACCGT CACCCTCTTC CTCGCCAAGG GGACCCTGCT GGAACCGGAC
CTGGATCCGG CCCTCCCGGG CCACCAGATC CGCGTCACAG ACGGCCTCGG CGTCTTCCAG
GTTCGCTCCG GCCGCGCCAC CGGCAACGAC CTCCTCATCG CGAAGGACGG CAACGGCAGC
GCCGCCCAGT CAGAGCTCTT CTTCAGCCCA TACCGGCGCG ACTGGATCCT GGTCGGCCTG
GGCTCCCTGA CCGTAGGGGG GAAGGCGGTA AGCGGCAACC TGGAAAAGAT CGACAAGGAC
GACCGCTTCG ACGAGGGAAT CTTCCACGAG GAGCGGCTCG CCTTTTTCAC CCGCGGCAAG
ATCCTCGGCA AGTACCTCCT CACCGCCGCC TACGACTCGG ATAAGGAGCG CCGCGACGGC
GTCTTCCAGG CCATCGACCC GGAAAAGTAC TACCCTGTCT ACGGCGACGC CACCGACATA
GGGTACGAGG CGGAGTCCCG CGGCAAGATC TATCTCAAGC TGGAAAGCGG CCGCTCCTAT
CTCATGGCCG GCGACTACCG CACCGACCTC TCCGAAAACG AATTCTCCCG CTACGACAGG
GCGCTAAACG GCGCCAAGTT CGAGGTCAAC ACCAAATCGG TGACCGTCAA AGGGTTCGAA
AGCCGCACCG AGGAAACGGT GACGCGCGAC GACATCCCCG GCAACGGCAC CTCCGGCCAC
TACTTCCTCT CCAGGCGGGG CGCCTTCGAG AACAGCGAGC GCGTGCGCAT CGAGGTGCGC
GACCGCTACC ACACGGAACG CGTCATCGCC GTCGCCGAGA AGGTCCGCTA CGCCGACTAT
TCCATCGATT ACCAGGCCGG GACCATCCTC TTCAAGGAGC CGGTCCCTTC CCTGGACCAG
TTCCTGAATC CGGTCACCAT CGTGGTCAAT TACCAGGCCA CAGGCCCCGG GGAGAAGCGC
TACGTCTACG GCGGACGCGC CGAGATCCGT TCCGAAAACG GCTCTTATTT CGGCGGCACC
GCCGTTGTGG AGGAGCAGGC GCGGCAGGAT AACACCCTCT TTGGCCTGGA CGGAGCCTGG
CGCCTGGGCG AGCGCCTGAC CCTGAAGGGG GAGGGGGCGG TCAGCGAGAA CCTGGAACAC
GGCAGGGGGA GCGCCTGGAA GACGGAGCTC GCCGCGCGAC CCATCGATCC CCTCCTCTTG
AACCTTTACT ACCGGAAGGT GGAGTCCGAC TTCTTCAACA CCTCGATGAC CGGGACGGAG
CTTGGGACCG AGAAGTACGG CGGCCGGGCC GACTTCCGCA TCGGCGCGGA CGCCCTCGTC
TTCGCGGAGA GCTTCCTGCA CCGCTTTGAA CTGGGGGACC GGAAACTCTT CGCGAACCAG
GTTGGGGCGG TGAAGAAATT CTCCCTGCTG CAGCTGGAGG GGGGCGTCAA GGTGGTGCGG
GAGGATCTGG ACGGACGGAC CGAGGGATCG GACCTGGTCT ACGCCGGGGT GATCGCGCCT
CTCACCAAAA GACTAGATGC GACGCTCAGG CGCGAGCAGC TTCTTTCGGC CTCCGGCGTC
GCCGACTACC AGACCAAGAC CTTCGCCAAG CTCGACTATC GTCTGACCGA GCGCACCAAG
GCCTTCCTCA CCGAGGAATA TCAGGAGGGG TCTCCCCTCA TCCGCCAGGC CACCCTGTTC
GGCCTGGAGA GCAGGCTCTC CGACCGGATG CGGCTCACCA CCGGCTACCA GATGTCGAGC
GGCGCCTCCG GGTACTCCCA GCAGAGCAAC GTCGACCTGA ACACGCGGCT CATGGAGCGG
GAAGGCTTCA GCCTCGATTC ACGTACCGGC TACCAGATCG AGCACGCTCT GTCGCAGCAA
AGGGGGCAGG CCATCGTCGG GTTGAACAGC CGTTACCGCG CCGCCGAGGG GCTGTACCTC
AACTCCTCCC TGGAGCGGGT GCAGACGGTG CAGGGAAATA CCGGCACCCG CACCGCTTTC
ACCTTGGGGG GGGAGTATCT CCGGGCAAAA GACCTGAAGC TCTCCGGCCG CTACGAGGTG
AGGACCGGCC CCGGAGAGAC CGCCTCCCTC TACGGCGCCA ACGCCGCCTT CAAGTTGAGC
CCGTCGCTAA CCCTCTTGGG CAAGCTCTCG CTCTGGGACC GGGACGCCGA CGCAGGCGAC
GACGTCATCT TCGACGGCTA CCTGGGGAGC GCCTTCCGCC CGCTGGCGGG CCGGCCGCTG
CAACTCCTCA CCCTGGCGCG CTACAAGGTG GAGGACAGAC GCTCACTCCC CGGCTCCTTC
GAAAGCCGCA GCGTCATCCT ATCGGCCGAG CCGACCTACA GGATCGTGAG CCGCTGGAGC
GCCCAGGGGA AATACGCAGG GAAGCTCAGC TGGGCCGACG GGGTGGGGGG AATGATGCAG
AGCTACACCG ACCTGGTCCT GGCCGGGCTC TCCTACGACC TGGCGCAGCG CTGGGAGATC
GCCGCCTACC TGAAGCTCAT GAACCAGTAC GACGCCGGGA TGCACTCGAT GGGAGCGGTG
GGGAGCGCCG GCTACCGCGT CTATAGAAAC GTCGTCCTCT CCGCCGGATA CAACTTCGCC
AGGCTCGACG ACCGGGACTT GACCGGCGAA ACCTTCCAGG GGCAGGGGCC CTTCGTCGGC
ATCAAGGTCA AGTTCGACGA GGACATGTTC GAGTCGCAGC AGGCGAGGGT GATCCCCATC
CCGGTGCCGC CCCCGGCCCC TGTCGCGAAG CTTCCCCCCG CCGCGGCTCC CGAGCCGGTG
CCGGCCCTGC TGGTGCGGGC GGAGCGGCTC GACGAGCCGC TTTTTCTCTC CGGGAGCGCG
GAACTCTTCA CCCTCCTGGT CAACGGCGAG CGGGCGAAGC TTCCTTCCAC CGAGGTGACG
CTGACCCGGG AGCGCCTGGG TTCCCTGGAG CTCAAGGGGG GAAGGTTCCC CGCGCCGCTG
GTCTTCCTGG TGAGCGTGGA GCAGCCGGAA CAGGTCGGCT CCTGGACGCT CAAGGTGATG
AACCGCGAGG GGGAGGCGCT GCGCACGCTT GAGGGGACCG GGGCGCCGGC AAAGCGGATC
CCGTGGCTTG GGGAGACGGA CCGCCGGAAG GTGGAACAGG GGGAGATCTA TCAGTACCAG
CTGCAGGTGA CCTACCTGGA CGGCTCGATA TTCAGCACCG GGCGGGAGCT CTTCGGCGTC
AACCGCCGCG AGGCGATACT CCTCACCCTC TCCGGCGGCG CTTTCGTCTT CGACCGCTCC
GAGCTGACCC TGGAGGCCAA GCGCCTGCTC AAGGGGGCGG CGCGGGTCCT GCGGGCGCGC
CCGCGGGAAA AGGTGATCGT GGAGGGGCAC ACCGACGGCA TCGGGAGCGT GGAGTACAAC
ATGGCGCTCT CCCAGAGACG GTGCGACGCC GCAGCCGACT ACCTGGTGCG GGAGGAGGGG
ATAGCCCGCT CGCGGCTGTT GCGGCGCTGG TACGGGAAAT CCCGCCCCGT GGCCGACAAC
GTAACTTCCC CCGGCAGGAG AATGAACCGC CGCGTCGAGC TGAAGGGGGA TTTCAAAGAG
CTGCATCCGG TCTCTCCCGA CGACCGTTAC CGGACGAAGC CCTTCGTGTT GATCAACGGC
CGCAGCATCC CGGTCGATCC CCTGGGGCGC TTCGACACCA CCCTCCCGGG CCACACCCTC
GATCTCGACC TCGAGATGGG GGATTCGCAG GGGCGCTTCC TCGCCACTTC GCTCCCGCTT
CCCGATCTGG ACGTGACCGG GCCTGCCGTC GAGACGCTGG TGGGGTACGG TACTGAAGCC
TCCGGCGTCA GGGTCGATGC CGACGGCAAG GCCCATTGCA TGCTCTCGGG AACGGTCGAG
GGAACCTCCA TGGAACTGGC GGGGAGGAAG GTGCCGCTCG ACGAGGCAGG GCGCTTCACG
CTCGATCTCC CGCTTGCCGA GGGGGACCAG GTCCTGGGCG TGGTGCTTAG AAATGGCTCG
GGCTGCTCAA AGCTCATGAA CCTGCGGCTC CGCTCGGAGC GCCAGACCTT GCCTCCCGCG
CGGGGTGAGC GGTGA
 
Protein sequence
MTIAAGKKRR RWGGLRLIAL AFLLAFPGTA FPLTPAGTVI RHDFTARFAT PALTEVRSNE 
TQVSVKDLSD PELVPPRTAT TNATVPVDFR HTLTNRGNFA DSFRLKAAAV DAALPENGQA
PLFRFYAADG VTPFPTDAGG LQVVGPLEPG ASVELVLRAT PAAGSEGRVA LIAVSATSVL
VPARSASLTN QLQVPAGGVS FLKAVSPVGS VLPGTLLTYG ITLGNSGGAP VAGVRVVDPL
DPLLEYQPGS AALPAGVPGS VSFDAATRTL VFDIPSLPAG FSGEITFQAR VRDDAPGNAS
IVNTAGVSTA QNAAPSLSNS TLNTVLVRLL QVTKAAGSSA ADAGDIVSYT VRVQNVGAGV
LNHVTVGDRI PRGFRYLKGA SHVDGFAVSD PVGGGQQLSW DLGSLEPGAF KVLNYRLVVS
AEAPVGTAVN WARATGTPAG GGSVVSPPAS ASVKVRSSVL GSKAVIMGRV FHDGNGNGVP
DQGERGEAGV RVYLEDGSFV FTDVEGQYSF TGVSAGDHVV KLDRSTLDPG FVSVPYNTAF
AGVGWSQFIT VPFGGPARGD FALAGKAEAS APQPAGRTGQ TSPTSPTSPT SPTLLAPPAA
EPDAVPRPRL RLTPALVELP ADAKTTVPFK VELLDPADRR VPGSATVTLF LAKGTLLEPD
LDPALPGHQI RVTDGLGVFQ VRSGRATGND LLIAKDGNGS AAQSELFFSP YRRDWILVGL
GSLTVGGKAV SGNLEKIDKD DRFDEGIFHE ERLAFFTRGK ILGKYLLTAA YDSDKERRDG
VFQAIDPEKY YPVYGDATDI GYEAESRGKI YLKLESGRSY LMAGDYRTDL SENEFSRYDR
ALNGAKFEVN TKSVTVKGFE SRTEETVTRD DIPGNGTSGH YFLSRRGAFE NSERVRIEVR
DRYHTERVIA VAEKVRYADY SIDYQAGTIL FKEPVPSLDQ FLNPVTIVVN YQATGPGEKR
YVYGGRAEIR SENGSYFGGT AVVEEQARQD NTLFGLDGAW RLGERLTLKG EGAVSENLEH
GRGSAWKTEL AARPIDPLLL NLYYRKVESD FFNTSMTGTE LGTEKYGGRA DFRIGADALV
FAESFLHRFE LGDRKLFANQ VGAVKKFSLL QLEGGVKVVR EDLDGRTEGS DLVYAGVIAP
LTKRLDATLR REQLLSASGV ADYQTKTFAK LDYRLTERTK AFLTEEYQEG SPLIRQATLF
GLESRLSDRM RLTTGYQMSS GASGYSQQSN VDLNTRLMER EGFSLDSRTG YQIEHALSQQ
RGQAIVGLNS RYRAAEGLYL NSSLERVQTV QGNTGTRTAF TLGGEYLRAK DLKLSGRYEV
RTGPGETASL YGANAAFKLS PSLTLLGKLS LWDRDADAGD DVIFDGYLGS AFRPLAGRPL
QLLTLARYKV EDRRSLPGSF ESRSVILSAE PTYRIVSRWS AQGKYAGKLS WADGVGGMMQ
SYTDLVLAGL SYDLAQRWEI AAYLKLMNQY DAGMHSMGAV GSAGYRVYRN VVLSAGYNFA
RLDDRDLTGE TFQGQGPFVG IKVKFDEDMF ESQQARVIPI PVPPPAPVAK LPPAAAPEPV
PALLVRAERL DEPLFLSGSA ELFTLLVNGE RAKLPSTEVT LTRERLGSLE LKGGRFPAPL
VFLVSVEQPE QVGSWTLKVM NREGEALRTL EGTGAPAKRI PWLGETDRRK VEQGEIYQYQ
LQVTYLDGSI FSTGRELFGV NRREAILLTL SGGAFVFDRS ELTLEAKRLL KGAARVLRAR
PREKVIVEGH TDGIGSVEYN MALSQRRCDA AADYLVREEG IARSRLLRRW YGKSRPVADN
VTSPGRRMNR RVELKGDFKE LHPVSPDDRY RTKPFVLING RSIPVDPLGR FDTTLPGHTL
DLDLEMGDSQ GRFLATSLPL PDLDVTGPAV ETLVGYGTEA SGVRVDADGK AHCMLSGTVE
GTSMELAGRK VPLDEAGRFT LDLPLAEGDQ VLGVVLRNGS GCSKLMNLRL RSERQTLPPA
RGER