Gene GM21_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3131 
Symbol 
ID8138481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3635941 
End bp3637503 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content66% 
IMG OID644870735 
Producthypothetical protein 
Protein accessionYP_003022917 
Protein GI253701728 
COG category[S] Function unknown 
COG ID[COG1808] Predicted membrane protein 
TIGRFAM ID[TIGR00271] uncharacterized hydrophobic domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones170 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGGA CGCACCTGAA CTACTACAGG CGGAAACTGA CGGTCTTTCT CGCGGAAAAG 
GCGGACATGG TGCAGCACCG CGAGGTGATC AGGGAGGTCG CCTCCGGGGT CGAGCGGAGC
TGGGTCTACT ACCTGATGCT CGTGGTGGCG GGGCAGATAG CGCTTCTGGG GCTGCTCACC
AACAGCGTCG CCGTGGTGAT CGGCGCCATG CTGATTTCCC CGCTCATGGG GCCGATCATC
TCGTCGAGCC TCGCCTTGAC CATAGGCGAT CTCTCCTTGG CGCGCCGCGC CTTCAAGACC
ATCGCCGTGA GCGTGCTGCT CACCGTGGCG GTGAGCGCGC TGATCAGCCT CGTCTCGCCG
CTGAAGGAGC CGACCGCCGA GATCCTGGCC CGGGTGAGGC CCAACATCCT GGACCTCTTC
GTGGCGGCGC TCTCGGGGGT CGCGGGCGCG GTCGCCCTTT GCACCAAGCG CAACTATGTG
GTCACAGCCA CGGGGGTCGC GGTCGCGACA GCGGTCATCC CCCCCTTGAG CGTGGTGGGG
TACGCGATGG GGACCTGGCA GCCCAAGCTG GCGCTGGGGG GCTTTCTCCT TTTCTTCACC
AACTTCGTCG CCATCGTGCT CGCCTCGGAC CTGGTCTTCT TCACCCTGGG CTTCAGAACC
AGCCTGGCGG AAGAGACTTC CTTCTCGCAC CGCACCAGGA TCGCGGTGAT CGGTGCGGTG
CTGGCCCTGG TGTCGGTCCC CCTGGTCTAT ACCTTGGGTG CGGACGTGGC GAGGCTGAAG
GAGAAGAAGC GGATCGAGCG CATCCTGAAG AGTCACCTGA ACCGCGAGCA GGTCTCCCGC
CTCACCGGCT ACCAGCAGAC GCCGCGAGAC AAGGAGCTAT TGGTGCGGGC CTCGGTCAAC
ACGGTGGCGC TGATCGATAG GCCCGAGCGG CAAAGCATGG AGCAGGAATT GGCACGGGGG
CTGAAGCGCC CGGTGCGCCT GGAACTGGAG CAGGTGATAG TCGCCTCCGG GCGGGAACTG
GCGCCCGTCG AAGGGAAGCG GGAGCCTCTC CCGGTGAGCC GGGGACAGCT TTCAGCGGAA
GTTGGGGCCA TGGTGGCAAG CGCCGAGCAG GAACTGTCGA GGGCGCTCGA GCCTTTTCCG
GTGAGCCGGA CCAAGGTCAC CTTCGCCGCG CCGGGAGAGC CTTTGCTGAT CACGGCGACC
CTTAGGCGCG ACTACCCTTT GAGCCGCGAC GAGCTGCAGA TCCTGTCGCG GGAGCTGGCG
CGGGTGCTGG AGCTCCCGGT GGAGCTGAAG GTCGAAGCTA AGCCGCTTCT GCCGCAGCTG
ACCTTCACCG CCGACGGCGA ACCGGTACCG CAGACCCAGC AGGCTCTGGA AATCGTCAAG
AGTCTGCCGG AAGGGCCCGC CTCCTTCCGC TTCCGCCTCT CCGCGCCCCC CGACCGGCGC
CGGGAGGCGC TCTCCCTCAA GGGGTACCTG ACCGGGAAGC TCGCCGTACC CGAGTCGGTG
CTGTCCGTTT CGACGCAGAC GCAGAAAAAG CACGGCGTTA CCTTGAGCGT CGTGCGGCAG
TAA
 
Protein sequence
MIRTHLNYYR RKLTVFLAEK ADMVQHREVI REVASGVERS WVYYLMLVVA GQIALLGLLT 
NSVAVVIGAM LISPLMGPII SSSLALTIGD LSLARRAFKT IAVSVLLTVA VSALISLVSP
LKEPTAEILA RVRPNILDLF VAALSGVAGA VALCTKRNYV VTATGVAVAT AVIPPLSVVG
YAMGTWQPKL ALGGFLLFFT NFVAIVLASD LVFFTLGFRT SLAEETSFSH RTRIAVIGAV
LALVSVPLVY TLGADVARLK EKKRIERILK SHLNREQVSR LTGYQQTPRD KELLVRASVN
TVALIDRPER QSMEQELARG LKRPVRLELE QVIVASGREL APVEGKREPL PVSRGQLSAE
VGAMVASAEQ ELSRALEPFP VSRTKVTFAA PGEPLLITAT LRRDYPLSRD ELQILSRELA
RVLELPVELK VEAKPLLPQL TFTADGEPVP QTQQALEIVK SLPEGPASFR FRLSAPPDRR
REALSLKGYL TGKLAVPESV LSVSTQTQKK HGVTLSVVRQ