Gene Mext_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2087 
Symbol 
ID5831808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2329011 
End bp2330150 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content69% 
IMG OID641367885 
ProductHPP family protein? 
Protein accessionYP_001639554 
Protein GI163851511 
COG category[T] Signal transduction mechanisms 
COG ID[COG3448] CBS-domain-containing membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.211615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.139739 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATC CCGGTGGCCC GGAACAACGC GCTGAACGAT CGCAAGGATT CCGCCTGTTC 
AGACCGATTC TGGCCGGCGC GACCCTGCGT GAGCGCCTGA TCGCGTGCCT GGGTGCTCTT
GCGGGCATCA CGCTCACCGG CCTGGTCTGC GGCTGGTTCT TCGGAGAAGG CCCCCATATC
CCGCTGATCG TCGCGCCGAT GGGGGCGTCG GCGGTGCTGA TCTTCGCCGT GCCGGCCAGT
CCGCTCGCCC AGCCCTGGTC GGTCATCGGC GGCAACACCA TCTCCGCGTT CATGGGCGTG
CTCGCTGCGC ACCTCATTCC CGATCCTGTC ATTGCGATCG GCGTCGGCGT CTCCCTTGCG
ATCGCGGCGA TGTCGCTGAC CCGGTGTCTT CACCCGCCGG GCGGGGCCGC CGCCTTGACC
GCACTCATCG GCGGCCCGGC CGTCACGTCG GCGGGCTTCC TGTTCCCGCT TTTCCCGGTC
GGCCTGAACT CGGTCATTCT CGTTGCGCTC GGCATCGGCT TCCACAAGCT CTCGCGCCGC
AACTACCCGC ACGTCGCGGT CGCGACGCCG GTGAACACCC ATGGGACGGG GGATTTGCCG
GCCCCGCTCC GGGTCGGCTT CCGGCCTGAA GATGTCGATG CGGCCCTGGT CGCGCTCGAC
GAGACGCTGG ACATCGACCG CGCCGATCTC GACCGGCTTC TCCGGCAGGT CGAACTCCAC
GCACTCGTGC GCGCACGGGG GGATCTGACC TGCGGTGAGG TGATGTCACG CGACGTCGTC
ACCATCGGGC TCGATGGCAG CGCCGAACGG GCACGGGAGC TTCTGCTCGC CCACAACATC
AGGACGCTTC CCGTCATCGA CCGGGCCGGC CGGCTCGCCG GAACGATCGG CCTGCGCGAG
CTGACTCTGC ACGGCGAGGT GGCGCTGGCG CAGGTGATGT CCGAGGCCAG GACGACCGGG
CCGGACGACC CGGTGATCGC GCTGGTGAAC GATCTGACGG ACGGTCACAC CCATGCGGTC
GTCGTCATCG CCGACGACCG GCGCGTGCTG GGGATCATCA CCCAGACCGA TCTGCTCGCG
ACCCTGACGC GCCTGCTCTC CGCCAAGGCG TTCGCGCTGC CCGATCCGGT CTCACCCTAG
 
Protein sequence
MPNPGGPEQR AERSQGFRLF RPILAGATLR ERLIACLGAL AGITLTGLVC GWFFGEGPHI 
PLIVAPMGAS AVLIFAVPAS PLAQPWSVIG GNTISAFMGV LAAHLIPDPV IAIGVGVSLA
IAAMSLTRCL HPPGGAAALT ALIGGPAVTS AGFLFPLFPV GLNSVILVAL GIGFHKLSRR
NYPHVAVATP VNTHGTGDLP APLRVGFRPE DVDAALVALD ETLDIDRADL DRLLRQVELH
ALVRARGDLT CGEVMSRDVV TIGLDGSAER ARELLLAHNI RTLPVIDRAG RLAGTIGLRE
LTLHGEVALA QVMSEARTTG PDDPVIALVN DLTDGHTHAV VVIADDRRVL GIITQTDLLA
TLTRLLSAKA FALPDPVSP