Gene Mext_3207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3207 
Symbol 
ID5831499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3557647 
End bp3558795 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content72% 
IMG OID641369007 
ProductCBS domain-containing protein 
Protein accessionYP_001640665 
Protein GI163852622 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.403327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACG ACCGAAGTCG CGGCGCGGCC CTGGCCGCGC CAGCCGCGGA CGCCGAACCG 
CCCCCACGCG AGGCGTGGTA CGATCGTCTG CTGAACGTCT TCCAAATGCG TCCGCGCGAC
TCCCTGCGCA CCGACATCGA GGAGGCCTTG GCCGAGCCCG ACACGGGTGA GGACGCCTTC
TCTCCCCTTG AGCGCGCCAT GCTCAAGAAC GTGCTCGGGC TGCACAAGGT GCGCGTCGAC
GACGTGATGC TGCCCCGCGC CGACATCGTG GCGGTGGCCA GCGACACCAG CCTCGGCGAT
CTCCTGAAGC TGTTCCGCAC CGCCGGCCAT TCGCGCCTGC CGGTCTACGG CGAAACCCTC
GACGATCCCC GCGGCATGGT CCACATCCGC GACTTCGTGG AATACCTCGC CACCCAGGCG
GAAGCCGCCC CGCGCCGGGC CGCGCCGCAG CCTGTGGTGG CCACCGGCGC CGAGGCCAAG
CCCACGCCGC GCCCGCGCCG CACGGCCTCC GCCCGCGGCG CGCTGCGCAG CCTCGATCTC
GGCAAGGTCG ATCTCACCGC AACCCTCGCC TCCACCCGCA TCCAGCGCCC GGTCCTGTTC
GTGCCGCCCT CCATGCCGGC GATCGACCTG CTGGTGCGGA TGCAGGCCAC GCGCACCCAC
ATGGCGCTGG TCATCGACGA GTATGGCGGC ACCGACGGGC TGATCTCGAT CGAGGATCTG
ATCGAGATGG TCGTCGGCGA CATCGAGGAC GAGCACGACG TGGCGGAGGG CCAGCTCGTC
AACCGCATGG AAGGCGAGAC GGAGGCCTAT ATCGCCGACG CCCGCGCCGG GCTCGCGGAA
GTATCGGCGG CAACCGGCCT CGACCTCGCC GCCGCTTTCG GGGAACTCGC CGAGGAGATC
GACACGATCG GCGGCCTGAT CGTGACGCTG GCCGGCCGGG TTCCGGCGCG CGGCGAGCGG
ATCCCCGGTC CCGACGACAT CGAGTTCGAG GTGCTGGACG CCGATCCCCG GCGGGTGAAG
CGGATCAAGC TCCAGCGCGC GCCGGCCAAG ATCGGCACCG TCGTGCCGCT CGCCCTACCG
CCGCCCCGCC CGGCGGCGCC GCAGGCACCC GACACGGACG CAGCGCAGGC CGCCGAGGCC
GGGCGCTGA
 
Protein sequence
MTNDRSRGAA LAAPAADAEP PPREAWYDRL LNVFQMRPRD SLRTDIEEAL AEPDTGEDAF 
SPLERAMLKN VLGLHKVRVD DVMLPRADIV AVASDTSLGD LLKLFRTAGH SRLPVYGETL
DDPRGMVHIR DFVEYLATQA EAAPRRAAPQ PVVATGAEAK PTPRPRRTAS ARGALRSLDL
GKVDLTATLA STRIQRPVLF VPPSMPAIDL LVRMQATRTH MALVIDEYGG TDGLISIEDL
IEMVVGDIED EHDVAEGQLV NRMEGETEAY IADARAGLAE VSAATGLDLA AAFGELAEEI
DTIGGLIVTL AGRVPARGER IPGPDDIEFE VLDADPRRVK RIKLQRAPAK IGTVVPLALP
PPRPAAPQAP DTDAAQAAEA GR