Gene GM21_0898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0898 
Symbol 
ID8136219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1072033 
End bp1074087 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content64% 
IMG OID644868514 
ProductOrganic solvent tolerance protein 
Protein accessionYP_003020723 
Protein GI253699534 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.33463e-30 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGCTG CAAGAGCCGC CTGGCTTCTC ACATACCTTC TGCTGTCGGC GGTACCCGCC 
CAGGGCGAAC CCGCGGTCCC CGTCGATAAA GAGATCACCC TGAAAGCGGA CGACCTCTCG
GTGGACGTTC CGACTCAGAG CTACCGTGCC CAGGGCGAGG TCCAAATCAC CCAGGACGGC
CTCTCCCTTC TGGCCGACAG CGTGGTCTAT CGCCGGCTCA CCGGCGAGGC CCAGGCGCAG
GGGGGCGTAC TCCTTGAGCG CAGCGGCGAC ACCATGAAGG GGGACAGCCT CTCTTTGAAC
CTGCTCTCCC AGACAGGGGA ACTCCTAAAC GGCGAGCTTT TCGTCAAGAG GTCGAACTTC
CGGTTGCGCG CCGAGCGCCT GGAGAAGACC GGCCCCGCCG ACTACAAGAT GACCAAGGGA
ACCTTCACCA CCTGCGACGG CGACAAGCCC AGCTGGAGGT TCGAGGCGAG GCAGGTGAAG
GTGACCCTGG AGGAGTTCGC CACGGCCAAA GACGCCGTCT TCTACGTCGG CGACGTCCCC
ATCTTCTATA CCCCGTACCT CATCTTCCCC GCCAACATCG AACGGCAGTC GGGATTGCTG
CTCCCGAGGC TCGGTTACTC TTCCAAGAAG GGGTTCTACT ACGACCAGCC TTACTACTGG
GCCATCAATC CGAGCCAGGA GGCGACCTTC AACCTCAACC TGGAAAGCTC CCGGGGAGTC
GGGGGCGGTG TGGACTACCG CTACCTGCGT CCGCACGGCA GCTCAGGGAG GCTGCAGACC
TTCGGCATCT ACGACACCCA GAAGTCGGAG TTCCGCGGCG AGGTGGACCA GCGGCACCTG
GAGCTTCTCA CCCCCAGGCT CACCCTCGCC TCCAATATCC ATCTCATCAC CGACCGCCGC
TATTTCCTGG ATTACGGCGA GCTCTCCGGC GAGTACAACC GGCAGTACCT GGAGTCGACG
CTCTCCTTCG ACCAGCGCTG GGAGCGCAGC AGCCTGTTCG GCGAGCTGCG CTACACCGAC
GACCTGGAGG CCCCCAACAA CGACGCCACC TTGCAGCGGC TCCCCACGCT CGGTTTCATC
GCCGCGGGCG AGAAGGTGGG GCCCGCTTTC TTCTCCATGG ATAGCCGCTT CACCAACTTC
CAGCGGGAGG CTGGAGCCAC CGGGCAGCGC CTGCAGCTGC ATCCCCGGCT CGCCTGGTAC
GGCAAACCCG CCGGCATTTT GGACCTTTCC CTTTACGGCG GTTACCAGCA GCGTATGTAC
AGCGCCAAGG GGGAGATCGG CGAGAGTGGT TGGCGGCAAC TGGGGCAGGC GGACGCAGGG
GGCGCGCTCT CTTTGCCGCT GGAGCGCGTG TACGACGGCC GGCTGCGGCA TCTGATGATC
CCGGCGGTCG AGTACAGCTT CGTACAGCAA CGGCGCGACG AAGACCTCCC GTTTTTCGAT
TACGACGACC GCGTGCTGGG GCAAAATGCC GTCCGCTGGT CGCTCAGCAA CGTGGTGACC
CGGAAGTTCG CCGAAGCGGA CGGAATACCC GAGTACCGTG ACCTCCTCTA CCTGAAGCTC
TCCCAGGGGT ACTGGCTTTC GGGGCAGCGC CGCGACCTTC TCACCCTGGT GGACGAGGGG
CACCGGCTCA CGGACCTGAT GCTGGAGGGT GTGCTCACCC CGGTGCAACG GCTCTCCGTG
GCGCTGGACA CACGCTACAA CACGACCGAC AGCAGGTTTT CCACCGCGAA CGTCGGGGTG
GAGCTGAAGG GAGAGGGGCG CGACAAGGCG AAACTCGGCT ACCGCCACAG CCGCGGGGAA
ATCGACTACG TCGAGGGGGG CTTCACCTTC CCGATTACCA AGGACGTCAC CGCCGATCTG
CTGGGGCGCT ATTCCGCCGA CAGGGGGGAG TTCCTGGAAT CCCGCTACGC GGTCGAGTAC
CGGCGCCAGT GCTGGAGCGT CATCTTCACC TACTCCGACC GGGTCGGCAG CCGCAACGTA
GCAGGCGAGC AGCAGTTCAG CGTCAACTTC TCGCTGGCGG GGCTCGGTTC GCTGGGGCAG
TTGCGGGCGT TTTAA
 
Protein sequence
MKAARAAWLL TYLLLSAVPA QGEPAVPVDK EITLKADDLS VDVPTQSYRA QGEVQITQDG 
LSLLADSVVY RRLTGEAQAQ GGVLLERSGD TMKGDSLSLN LLSQTGELLN GELFVKRSNF
RLRAERLEKT GPADYKMTKG TFTTCDGDKP SWRFEARQVK VTLEEFATAK DAVFYVGDVP
IFYTPYLIFP ANIERQSGLL LPRLGYSSKK GFYYDQPYYW AINPSQEATF NLNLESSRGV
GGGVDYRYLR PHGSSGRLQT FGIYDTQKSE FRGEVDQRHL ELLTPRLTLA SNIHLITDRR
YFLDYGELSG EYNRQYLEST LSFDQRWERS SLFGELRYTD DLEAPNNDAT LQRLPTLGFI
AAGEKVGPAF FSMDSRFTNF QREAGATGQR LQLHPRLAWY GKPAGILDLS LYGGYQQRMY
SAKGEIGESG WRQLGQADAG GALSLPLERV YDGRLRHLMI PAVEYSFVQQ RRDEDLPFFD
YDDRVLGQNA VRWSLSNVVT RKFAEADGIP EYRDLLYLKL SQGYWLSGQR RDLLTLVDEG
HRLTDLMLEG VLTPVQRLSV ALDTRYNTTD SRFSTANVGV ELKGEGRDKA KLGYRHSRGE
IDYVEGGFTF PITKDVTADL LGRYSADRGE FLESRYAVEY RRQCWSVIFT YSDRVGSRNV
AGEQQFSVNF SLAGLGSLGQ LRAF