Gene Mext_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1200 
Symbol 
ID5831506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1323881 
End bp1325122 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content64% 
IMG OID641366993 
Producturea ABC transporter, urea binding protein 
Protein accessionYP_001638673 
Protein GI163850630 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.12535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCGC GTGACCGAAA CGACGCCCAC TTCCATCTCC GCCGCCGACT TCTGCTCGGC 
CTTGCCGCCT CTCCGCTGCT AAGTGCCCTG CCCGGACGTG CCCTTGCGCA GACCGGCGCG
GGCGCGCTCG CCGTCACCGA CAAGGAGGTG ACGATTGGCA TCCTGCACTC GATCAGCGGC
ACCATGGCGA TGTCCGAGAC CGGGGCGACG CAAGGGGAAA GGCTCGCCAT CGAACAGATC
AATGCGAGCG GCGGCATCCT CGGCCGCACG GTGAAGGTGA TCCAGGAGGA CGGCGCCTCC
GACTGGCCGA CCTTTGCCGA GAAGGCGCGC AAGCTCGTCG TCAACGACCA TTGCGCGGCG
GTGTTCGGCT GCGTGACCTC GGCCTCGCGC AAGGCGGTGC TGCCGGTCTT CGAGCAGTAT
AACGGCCTCC TGTACTATCC GACCTATTAC GAGGGTCTGG AGCAGTCCAA GAACGTCATC
TACACCGGCC AGGAGGCGAC CCAGCAGACG CTCGTCGCCC TCGACTGGGT GACGAAGGAG
AAGGGCGCCA AGTCCTTCTT CATGGTCGGC TCGGACTATA TCTGGCCGCG CACCACCAAC
AAGATTGCGA CCAAGCACAT CACCAACGTG ACCAAGGGCA CGATCGTCGG CGAGGAATAC
TTCCCCCTCG GCCACACGCA GTTCAACTCG GTCATCAACA AGATCAAGCT CAAGAAGCCG
GACGTCATCT TCGCCACCGT CGTCGGCGGC TCGAACGTCG CCTTCTACAA GCAGCTCAAG
GCGGCCGGCA TCGACCTCAA GAAGCAGACG CTGGTGACGG TGTCCGTGAC CGAGGACGAC
GTCGACGGCA TCGGCGGCGA GAACATCGCC GACGCCTATA GCTGCATGAA GTACTTCCAG
TCGGTCAAGA CTCCGGCCAA CGAGGCCTTC GTCGCCGCCT TCAAGAAGCG CTGGGGCGAC
AAGACCGTCA TCGGCGACAT CACTCAGGCC GCCTATCTCA GCCCCTTCCT GTGGAAGGCG
GCGGTGGAGA AGGCCGGTTC CTTCGAGGTC GACAAGGTGA TCGCCGTTTC ACCGGGCCTC
GAGATCAAGG ACGCACCGGA AGGCGCCGTG AAGATCCACG AGAACCATCA CCTCTGGGCC
AAGACCCGCG TCGCCCGCGC CCGGCCGGAC GGGCAGTTCG ACGTGGTCTA CGAGAGCCCG
GAGCTGATCG AGCCGAACCC GTTCCCGAAG GGGTATCAGT AG
 
Protein sequence
MRSRDRNDAH FHLRRRLLLG LAASPLLSAL PGRALAQTGA GALAVTDKEV TIGILHSISG 
TMAMSETGAT QGERLAIEQI NASGGILGRT VKVIQEDGAS DWPTFAEKAR KLVVNDHCAA
VFGCVTSASR KAVLPVFEQY NGLLYYPTYY EGLEQSKNVI YTGQEATQQT LVALDWVTKE
KGAKSFFMVG SDYIWPRTTN KIATKHITNV TKGTIVGEEY FPLGHTQFNS VINKIKLKKP
DVIFATVVGG SNVAFYKQLK AAGIDLKKQT LVTVSVTEDD VDGIGGENIA DAYSCMKYFQ
SVKTPANEAF VAAFKKRWGD KTVIGDITQA AYLSPFLWKA AVEKAGSFEV DKVIAVSPGL
EIKDAPEGAV KIHENHHLWA KTRVARARPD GQFDVVYESP ELIEPNPFPK GYQ