Gene Mext_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0404 
Symbol 
ID5834203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp447638 
End bp448849 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content73% 
IMG OID641366188 
Productextracellular ligand-binding receptor 
Protein accessionYP_001637897 
Protein GI163849854 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.697779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.706576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAC GCGGCGGACG GTGGACGGCC CGCGCGGGTG CCGCGCTTCT CGGCCTCGGG 
CTCACGATCG GGCTCGCGGG CTGCCTCGGC GTTCCGGATG CGCCACGGCC GGCGGCTCGG
GCGGTGCCCG AGGGTGAGCC CAGCCCCACC ATGACCGCCG GCAACGTCAT TGGCGCGGGC
ACGGTGAAAG TCGCTCTGAT CGTCCCGCTG AGCGGGCAGG GGGCGGCGGT CGGCGCCGCC
CTGCGCAACG CTGCCGAACT CGCCTACGAC GATTTCCAGA AGCCGAACCT CCAGATCCTC
GTGAAGGACG ACCGCGGCAC GCCGGAGGGC GCACGCGAGG CGACGCAAGC GGCCTTCGCC
GAGGGCGCCG AAATGGTGCT CGGCCCGCTC TTCGCCGCCA ATGTCCAGGT GGCCGGGGGT
GTCGCCCGCG GAGCGGGCAA GCCGGTCATC GCCTTCTCGA CCGACGCGGC GGTGGCCGCG
CGCGGCGTCT ACCTCATCAG CTTCCTGCCG CAATCGGAGG TGGACCGCAT CGTCGATGAG
GCGAGCGCGG GCGGCCGCCG CTCCTTCGCC GCGCTGATCC CCGAGACGGT CTACGGCAAT
GCCGTCGAGG CGCAGTTTCG TGAGGCGGTG GCCCGGCGCG GCGCCCGGCT CGTCGGCATC
GAGCGCTACC CGGCCGGCAA TCCCGGCCCG GCGGTCGACC GATTGCGCGG CGTGATCGCG
GGCGGCGGCG CCCAGGCCGA CGCGCTGTTC GTGCCCGATA CCGCCGAGGG CCTGATGGCC
GTCGCGCCTG CCCTGACCAA GGTCGGGTTC TCGCCCGCGC GGGTCCGTCC TCTCGGGCTC
GCTCTGTGGA ACGACCCGCG GGTGCTGTCG CAGGCGGCTT TCCAGGGCGG CCGCTTCGCC
GCGCCCGACG CTGCCGGCTT TGCCGGTTTC GCCCAGCGCT ATCAGACCCG CTTCGGGACG
ATGCCGCCGC GCACCGCCTC GCTCGGCTAC GATGCGGTCT CGCTCACGGC CGCATTGGTG
CGCCAATACG GCTCGCAGCG CTTCGCCGAC GCGACGCTGA CCAATCCGGC GGGCTTCTCC
GGCCTCGACG GCACCTTCCG CTTCCTGCCC GAAGGCGTCA GCGAGCGGAC CTTGGCGGTC
TACGAGATCC GCAACAACGC GGCGAACGTG GTGAGCCCGG CCCCGAAGGT GCTTGCGCCG
TCGGGGATTT GA
 
Protein sequence
MARRGGRWTA RAGAALLGLG LTIGLAGCLG VPDAPRPAAR AVPEGEPSPT MTAGNVIGAG 
TVKVALIVPL SGQGAAVGAA LRNAAELAYD DFQKPNLQIL VKDDRGTPEG AREATQAAFA
EGAEMVLGPL FAANVQVAGG VARGAGKPVI AFSTDAAVAA RGVYLISFLP QSEVDRIVDE
ASAGGRRSFA ALIPETVYGN AVEAQFREAV ARRGARLVGI ERYPAGNPGP AVDRLRGVIA
GGGAQADALF VPDTAEGLMA VAPALTKVGF SPARVRPLGL ALWNDPRVLS QAAFQGGRFA
APDAAGFAGF AQRYQTRFGT MPPRTASLGY DAVSLTAALV RQYGSQRFAD ATLTNPAGFS
GLDGTFRFLP EGVSERTLAV YEIRNNAANV VSPAPKVLAP SGI