Gene GM21_3832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3832 
Symbol 
ID8139206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4415927 
End bp4417993 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content69% 
IMG OID644871449 
ProductTPR repeat-containing protein 
Protein accessionYP_003023607 
Protein GI253702418 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.00173771 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGTGG CGCTCGTCTG GCGCGGCAAC CCGGCCCACG GCAACGACGC CAACCGCTCG 
ATACCGCTCG CCAAACTGGC GCCGCTTGCT GAAGTCCCCG GCGTGAGCTT CTATTCCCTG
CAGGTGGGAG AGGCCGCGCG GGCGGAGCTT CCGGGCCCCT TCCCGATCGT CGACCTGGCG
CCTCACATCG AGGATTTCGC CGACACGGCG GCGCTTGCCG GAGAGCTCGA CCTGGTCGTC
TGCGTCGACA CCTCGGTGGC GCACCTGTGC GGCGCGCTCG GGGTGCCGGT CTGGCTCCTG
GTCCCCCTGG TCCCGGACTG GCGCTGGGGC CTTTACCGCG ACGACTCCCC CTGGTACCCG
ACCCTGCGCC TTTTCCGCCA AAGGCAGGCG GGAGAATGGG AAGAGGTGGT GGCCCGGGTC
AAGGAGGCGC TGGCGCGGGA GGCGTCCCCC GCTTCCGTTT CCCCGCAGGC GCTCCACGCC
GATGCCCGGG GAGAGGCCGA GATCTGGAAC AACCGGGGGT GCGCCGAGGC CGCCGCCGGC
AGGCACCTGG AAGCGGTGGA GAGCTACCGC GAGGCGATCG CCCTCGCCCC CGATCTCATG
CCCGCCCATT ACAACCTCGG CAACAGCCTC TACGCCCTGG GAAGAATCGC CGAGGCGGCC
GAAAGCTACC GCTGGGCCCT GGCTCTCGAT CCGGCGCTGC CGCAGGGGTG GCACAACCTC
TCTCTCGCCC TCAAGGCGCA GGGGGCGCTG GACGAGGCGC TGCATGCGCT GAGAAGGGCG
CTGCGGATCG CCCCGGATTA CCTGGAGGCC AGGCACACCC AGGGCGAGCT GCACCACGAG
CGCGGCGAGC TGGATGAGGC GCAGGCTTGT TTCCGCGAGA ACCTCTCCCG CGATCCCGGC
TACCTCCCTT CGTGGAACGC ACTGGGAATC TCGCTGCAGC TGCAGGGGCT TTTGGAGGAG
GCGGTGGATT GCTACCAAAA GGCGCTCGCC CTGAAGCCGG ATTACCTGCA CGCCCTGAAC
AACCTGGGGA CGGCGAGCCG CTCGCTCGGG CTTTTGGAGC AGGCGAAAGC GTGCTACCTG
AGGGTGCTGG AGATCGACGA AGGGTACGCC GACGCCCGCT GGAACCTGGC CCTGGTGCAG
CTGCAACTGG GAGAGTACCG GGAGGGGTGG CAGGGATACG AGTGCCGCTT CAGCAACGTC
GACCCCATTC CCAGGCTGGA ATTTCCCCGG CCGCTTTGGG CCGGCGGGAG CCTTGAAGGG
CGTACCATAC TCCTCACCAG CGAGCAGGGT TTCGGCGACA CCTTCCAATT CGTGCGCTAC
GCGAAGCTCC TGGCACAGGG GGGCGCGACG GTGCTGGTGC AGGCGCAGAG CGAGGCGATC
GCGCCGGTGA TCGCGACCGT ACCCGGCGTC GCCCGCGTGC TGGTCCGGGG GGAGCCGCTC
CCCGAGTTCG ACTGCCACGC TCCCCTGATG AGCCTGCCTT ATCTGTGCGG GACGGAGCTC
GCCTGCATTC CCGCCGAGAT ACCGTACCTC TTCGCCGACC CGGCGCTGGT TAAGAAGTGG
AGCCCGCTCC TCTCAGGTGA GAGGCTCCGG GTGGGGCTCG TCTGGGCGGG GAGAAAGAGC
TACAAGGACG ACCTGAAGCG CTCGCTGTCG CTGCCGCTCT TCGAGCCGCT CTCGAAGGTG
GCCGGCGCCG ATTTCTTCGC CCTCCAGGTG GGAGACGGGG CGGAGCAGGC GGCCATCCCG
CCGCCGGGGG TGAAGCTCAC CGACCTTGGG TGCAACATCA GGAACTTCGC CGATACCGCC
GCGGTGCTTA CCCGGCTCGA TCTGGTGATC ACGGCCGACA CCGCGGTGGC CCATCTTGCC
GGGGGGCTCG GGGTGCCGGC CTGGGTCCTG CTACCGGTGG GATGCGACTG GCGCTGGCTC
GCCGAAAGGG AAGACTCCCC CTGGTACCCG GGCGCGCGGC TTTTCCGGCA GACGCGCCGC
GGCGACTGGA ACGAGGTGCT GCAGCGCGTC GCCGACTGTC TGGCGACGAT GGTCCCCAAG
GGGGGGCAAC GGAACAGGGA GAGCTGA
 
Protein sequence
MKVALVWRGN PAHGNDANRS IPLAKLAPLA EVPGVSFYSL QVGEAARAEL PGPFPIVDLA 
PHIEDFADTA ALAGELDLVV CVDTSVAHLC GALGVPVWLL VPLVPDWRWG LYRDDSPWYP
TLRLFRQRQA GEWEEVVARV KEALAREASP ASVSPQALHA DARGEAEIWN NRGCAEAAAG
RHLEAVESYR EAIALAPDLM PAHYNLGNSL YALGRIAEAA ESYRWALALD PALPQGWHNL
SLALKAQGAL DEALHALRRA LRIAPDYLEA RHTQGELHHE RGELDEAQAC FRENLSRDPG
YLPSWNALGI SLQLQGLLEE AVDCYQKALA LKPDYLHALN NLGTASRSLG LLEQAKACYL
RVLEIDEGYA DARWNLALVQ LQLGEYREGW QGYECRFSNV DPIPRLEFPR PLWAGGSLEG
RTILLTSEQG FGDTFQFVRY AKLLAQGGAT VLVQAQSEAI APVIATVPGV ARVLVRGEPL
PEFDCHAPLM SLPYLCGTEL ACIPAEIPYL FADPALVKKW SPLLSGERLR VGLVWAGRKS
YKDDLKRSLS LPLFEPLSKV AGADFFALQV GDGAEQAAIP PPGVKLTDLG CNIRNFADTA
AVLTRLDLVI TADTAVAHLA GGLGVPAWVL LPVGCDWRWL AEREDSPWYP GARLFRQTRR
GDWNEVLQRV ADCLATMVPK GGQRNRES