Gene Mext_1169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1169 
Symbol 
ID5832419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1287933 
End bp1289018 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content70% 
IMG OID641366962 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001638642 
Protein GI163850599 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0939824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0445222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG AGCCCCCGCG CCCCGGCCTC CTGCCGTCGC GCCGGTTCCT GTTGCGCGCA 
GGCGCTGCCG CAGCGGCTCT CCCGCTTGGG TTCGGGGTGG GCGGCGTGCG GGCCTGGGGC
CCCGGCCCGG CCGTACCGTT CGATCCCGGC CCGATCTGCC GCCCCGCCGC CGCGGAGGGG
CCCGCCGGTC CCCTGAAGCC GATCAAGCTC GCCTGGAACG CCACCGCGAT CTGCACCGCC
GCGGCGCCGC TGGCCAAGGA GCGCGGCATC TTCGCCGCCC ACGGCCTCGA CGTGGAGTTC
GTGAATTTCG GCGGCTCGAC CGAGGCTCTG TTGGAGGCCA TTGCCACGGG CAAGGCGGAT
GCCGGCATCG GCATGGCGTT GCGCTGGCTC AAGCCTCTGG AACAGGGCTT CGACGTGAAG
ATCACCGCCG GCCTGCACGG CGGCTGTCTC CGGCTGCTCG GCGCGAAATC CGCCGGCATC
ACCGACGTCG CGGCGCTGAA GGGCAAAACG ATCGCGATCA GCGATCACGC GAGCCCGGCC
AAGAACTTCT TCGCCCTGCT GCTCGCGCAG GCCGGCATCG ATCCGGAGAC CGGCGTCGAG
TGGCGGCAAT ACCCGGCCGA CCTCCTCAAC CTTGCGGTCG AGAAGGGCGA GGCGCAGGCG
CTGGCCGATT CCGATCCGCG CACGTGGATC TGGCTGAAGG ATCCGAAATT CACGGAAGTC
GCGACCAACC TCTCGGGGGC TTACGCCGAT CGCACCTGCT GCGTGGTCGC CGTGCGCGGC
AGCCTGATCC GCAATGATCG CGCCGCCGCC GCCGCGCTCA CCCGCGCCGT GCTGGAGGCC
GGTCACCGCG TCCACGAGAA CCCGAAGGAC GCCGCACGCA TCTTTTCCGG CTACGGCGGC
AAGGGTTCGG TCGAGGATCT TGCCGCGATG CTGCGCAGCC AGCACCACGG CGACCGCCCG
GTCGGCACCG ACCTGAAACG CCAGCTCGTG CTTTACGGCG ACGAACTCAA ACAGGTGAAC
GTCCTCAAGC GCACCACCGA CACGGCTAAG TTCGCCGAGC GCGTCTATGC CGACGTGCTG
AGCTGA
 
Protein sequence
MTDEPPRPGL LPSRRFLLRA GAAAAALPLG FGVGGVRAWG PGPAVPFDPG PICRPAAAEG 
PAGPLKPIKL AWNATAICTA AAPLAKERGI FAAHGLDVEF VNFGGSTEAL LEAIATGKAD
AGIGMALRWL KPLEQGFDVK ITAGLHGGCL RLLGAKSAGI TDVAALKGKT IAISDHASPA
KNFFALLLAQ AGIDPETGVE WRQYPADLLN LAVEKGEAQA LADSDPRTWI WLKDPKFTEV
ATNLSGAYAD RTCCVVAVRG SLIRNDRAAA AALTRAVLEA GHRVHENPKD AARIFSGYGG
KGSVEDLAAM LRSQHHGDRP VGTDLKRQLV LYGDELKQVN VLKRTTDTAK FAERVYADVL
S