Gene Mext_3381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3381 
Symbol 
ID5834959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3746599 
End bp3747852 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content64% 
IMG OID641369180 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001640838 
Protein GI163852795 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACG ACAAGAAGGG CCTGGACTCG GCCCTGCGGC GCAAGCTTCT CATGGGTCTC 
GCGGGGCTTC CCGCGCTGGC GATGATGCCG CGGATGGCGT TCGCCGCCGC GCCGACCTCG
GCCGTCAACA CGACCGGGCT CGCGGTGACC GACACGGAAG TCACGGTCGG CATCCTGCAC
TCGGCCACCG GCACGATGGC GATCTCCGAG ACCGGCTCGA TCCAGGCCGA GAAGCTCGCG
ATTGCCCAGA TCAACGAGAT GGGTGGCGTG CTCGGCCGCA AGATCAAGGT GATCCAGGAG
GACGGCGCCT CCGACTGGCC GACCTTTGCC GAGAAGGCCA AGAAGCTCCT CGTCAACGAC
CATTGCGCCG CGGTGATGGG CTGCTGGACC TCCGCCTCGC GCAAAGCCGC GCTGCCGGTC
TTCGAGCAGT ATAACGGCCT GCTCTACTAC CCGACCTTCT ACGAGGGCCT GGAGCAGTCC
AAGAACGTGA TCTACACCGG CCAGGAGGCG ACGCAGCAGA TCCTTGCCTC GCTCGACTGG
GTTGCCAAGG AGAAGGGCGC CAAGTCGTTC TTCATGGTCG GCTCGGATTA CATCTGGCCG
CGCACCTCGA ACAAGATCGC CCGCAAGCAT ATCGAGAACG TGCTCAAGGG CACGGTCGCC
GGCGAGGAGT ACTTCCCCCT CGGCCACACG CAGTTCAACT CGGTCATCAA CAAGATCAAG
CTCAAGAAGC CGGACGTGAT CTTCGCCGAC GTGGTCGGTG GCTCGAACGT GGCGTTCTAC
AAGCAGCTCA AGGCGGCGGG CATCGACCTC AACAAGCAGA CCCTGCTGAC GATCTCGGTC
ACCGAGGACG AGATCGACGG CATCGGCGGC GACAACATCG CCGGCGCCTA TTCCTGCATG
AAGTACTTCC AGTCGCTGAA GAACCCGAAC AACGAGAAGT TCGTCGCCGC CTTCAAGAAG
ATGTGGGGCG ACAAGACCGT CATCGGCGAC GTGACCCAGG CTGCCTATCT CGGCCCGTTC
CTGTGGAAGA TGGCGGTGGA GAAGGCCGGC TCCTTCGATG TCGACAAGGT CGCCGCGGCG
TCCGCCGACA TCGAGTTCAA GGAGGCGCCG GAAGGCTACG TGAAGGTTCA CCCGAACCAT
CACCTCTGGT CGAAGACCCG CGTCGCCAAG GCCCTGCCGA GCGGCCAGTT CGAGGTGGTC
TACGAGAGCC CCGAGCTGAT CGAGCCGAAC CCCTTCCCGA AGGGCTACCA GTAG
 
Protein sequence
MADDKKGLDS ALRRKLLMGL AGLPALAMMP RMAFAAAPTS AVNTTGLAVT DTEVTVGILH 
SATGTMAISE TGSIQAEKLA IAQINEMGGV LGRKIKVIQE DGASDWPTFA EKAKKLLVND
HCAAVMGCWT SASRKAALPV FEQYNGLLYY PTFYEGLEQS KNVIYTGQEA TQQILASLDW
VAKEKGAKSF FMVGSDYIWP RTSNKIARKH IENVLKGTVA GEEYFPLGHT QFNSVINKIK
LKKPDVIFAD VVGGSNVAFY KQLKAAGIDL NKQTLLTISV TEDEIDGIGG DNIAGAYSCM
KYFQSLKNPN NEKFVAAFKK MWGDKTVIGD VTQAAYLGPF LWKMAVEKAG SFDVDKVAAA
SADIEFKEAP EGYVKVHPNH HLWSKTRVAK ALPSGQFEVV YESPELIEPN PFPKGYQ