Gene GM21_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1087 
Symbol 
ID8136409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1273686 
End bp1275479 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content64% 
IMG OID644868698 
Productsurface antigen (D15) 
Protein accessionYP_003020906 
Protein GI253699717 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0729] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000000000374236 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCTGACAC TGATAAAGGT CCGCCCGATG CCCAAACGGT TTCCTCCATA CCTAGCTCTT 
ATCTGCCTCG CGGCGCTGCT TCTGCGCGCG CCTGCCGCCT TCGCGGCCGA TCCGGTCGAG
ATCGCGGTGA CGGGTGTGGA AGGCGATCCC CTGGAGAACG TGAGGCAGGC GCTGGCGCTT
CCCTACGGCC TGGTCCGGGA GGGGAAAGTG GACCGGCTCT GGCTGGACCG CTTCGCCAAA
AAGGCGCCGG ACAAGGTACG TCAGGCGCTT GAGCCTTACG GTTACTACAA GTCGGAGGTT
TCGGCCCGGG TGGCTCGGAC ACGGGACGGC AAACCGAGCC TGCAGGTCGT CGTCATCCCG
GGAGAACCGG TGCTGGTCAC CGAGGTGACG GTGGAGTTGA AGGGGGCGGG AGCTAAGGCG
AGAAGGCTTT CGCGGGTGCG GGACGCCTTT CCGGTGAGGA GGGGGGAGGT GCTGCTGCAA
CCGGATTACG AGCGGGGCAA GGGGGCGCTG CAGTCGCAGG CCCGGAAGCT TGGCTACCTC
GACGCCTCCT TCCCCCGCCA CGAGATCCGC ATCAGTGAAG ACCGCACGAA AGCCAGGATC
GCCCTGGTGC TGGACACCGG CCCCCGTTTC TATTTCGGCC CGGCGGTCAT TCAGGGAGCG
CCCGATTACC CCGAGTCCTA CCTGAAGCGG TTCGTCGCCT TCAAGGAAGG GGAACCTTTT
TCCTACGCGA AGATCGGGGA GACGCAGCTC AATTTCGCCA ACTCCGAGCG CTTCAGGCAG
GTGGTGGTCA CGCCGGAGCG CGAGCGGGCC GAGAACGCCC GGGTACCGGT CGTGGTGCAG
CTTACCGAGG CCCCGCGCCG CACGGTAAGG CCCGGGATCG GCTACGGGAC CGACACCGGC
GCCAGGTTCT CCACCCACTA CCGGGACTTG AACCTCTTCC ACAAAGGGCA CGACCTCGAC
TTGAGCCTCT ACGCGGCGCA ACGCCTGCAG GGATTCGCCG GGCGCTACAC CATTCCGAGC
AGCAGCGATT ACCGAAGTTC CACCGCCTTG CAATTGAACC TGCAAAAGGA AGACGTCACC
AATTACTTGA GCAAGATCGT CGCTTTGGAG CTGGACCGAA ACATTGGCCT TGGGCGCGGC
GAGCTGGCGA CGGCGTACGT GAGGCTGCTG CAGGAGGTCT TCACCATCGG CGACGAGAAC
GCGAACTCCA GGCTGGTGCT TCCTGGATTC CGCTTCTCCA AGGAAACCTT CAACAACATG
GTCCGCCCCA GGCGCGGTTA CGCCTACACC TTGGAGCTGC GCGGCGCTCA CCCGTATCTG
GGATCCGACA CCGGCCTCAT CCAGGGGATC GCCCATGCCA ACCTGCTGTT CCCGCTACCC
TGGCGCCTCT CCCTGCAAAG CCGGGGGGAC GCGGCGTACA GCCTGCTCGA TGACCCCTTC
TCGGAACTTC CCCCCTCCAT CCGCTTCTTT GCCGGCGGCG ACCAGAGCGT GCGCGGCTAT
TCCTACCAGA GCTTGGGCCC CAGCGACTCC TCCGGGAAGG TGGTGGGGGG GAGGCACCTG
CTGGTGGGGA GCCTGGAGCT TTTGCGCGCC CTCTACAAGG ACTGGGGGGT GTCGGTCTTC
TACGACATAG GGAACGCCTT CAACAACTAC GCGGACATGC GCCTGAAAGA CGGGACCGGC
GTGGGCATCC ATTACTACAC CGCGGTCGGG GGGCTGAACC TCTACCTCGC CAAGCCGCTT
GCCACCGGTG CGGGGAGCTA TCGCATCCAT TTCACCGTGG GGTTCCAGCT ATGA
 
Protein sequence
MLTLIKVRPM PKRFPPYLAL ICLAALLLRA PAAFAADPVE IAVTGVEGDP LENVRQALAL 
PYGLVREGKV DRLWLDRFAK KAPDKVRQAL EPYGYYKSEV SARVARTRDG KPSLQVVVIP
GEPVLVTEVT VELKGAGAKA RRLSRVRDAF PVRRGEVLLQ PDYERGKGAL QSQARKLGYL
DASFPRHEIR ISEDRTKARI ALVLDTGPRF YFGPAVIQGA PDYPESYLKR FVAFKEGEPF
SYAKIGETQL NFANSERFRQ VVVTPERERA ENARVPVVVQ LTEAPRRTVR PGIGYGTDTG
ARFSTHYRDL NLFHKGHDLD LSLYAAQRLQ GFAGRYTIPS SSDYRSSTAL QLNLQKEDVT
NYLSKIVALE LDRNIGLGRG ELATAYVRLL QEVFTIGDEN ANSRLVLPGF RFSKETFNNM
VRPRRGYAYT LELRGAHPYL GSDTGLIQGI AHANLLFPLP WRLSLQSRGD AAYSLLDDPF
SELPPSIRFF AGGDQSVRGY SYQSLGPSDS SGKVVGGRHL LVGSLELLRA LYKDWGVSVF
YDIGNAFNNY ADMRLKDGTG VGIHYYTAVG GLNLYLAKPL ATGAGSYRIH FTVGFQL