Gene Mext_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1148 
Symbol 
ID5835256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1256980 
End bp1258341 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content65% 
IMG OID641366941 
ProductABC transporter nitrate-binding protein 
Protein accessionYP_001638621 
Protein GI163850578 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.294193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.60829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCGT TCGACAATTC CTTCGATGCC AAACGCCGCT TCGCACGCGG CGGCTGCTCC 
TGCGGTTCGC ACGCGTCCCA GGCCGCGCAC GAGGCGGCCC TGCCGAGCGG GGACGCCGCG
ATCGAGCGGG CCGTCGAGGC GACGATGATG CGGGCTCTGT TTCCGCATCA GGCGGAGCGC
CGCGCCTTCC TGCGCAGCGT CGGCCTCGCG ACGGCGGCGG CGGCCGTCAG CCAGTTCCTG
CCGACGAAAT TCGTGGCCGA GGCCTTCGCC GAAGCGGGCA AGCCGGAGAA GACCGACCTC
AAGATCGGCT TCATCCCGAT CACCTGCGCC ACGCCGATCA TCATGGCCAA GCCCATGGGC
TTCTATGAAA AGCAGGGCCT CAACGTCGAT GTCGTGAAGA CCGCCGGCTG GGCGGTCGTG
CGCGACAAGA CCCTGAACAA GGAATACGAC GCCGCCCACA TGCTGGCACC GATGCCGCTC
GCCATCACGA TGGGGCTCGG ATCGAACCCC GTTCCGTTCG CCGTGCCGGC GATCGAGAAC
GTCAACGGGC AGGCAATCTG CCTTGCGAAT AAGCACAAAG ACAATCGCGA CCCAAAGAAC
TGGAAAGGAT TCAAACTCGC GATTCCGTTC GATTACTCGA ACCACAACTA CTTGTTGCGC
TACTACCTCG CCGAGCACGG CATCGATCCC GACACCGACG TGCAACTGCG CTCGGTGCCG
CCGCCCGAGA TGGTCGCGAA CCTGCGCGCC GACAACATCG ACGGCTTCCT CGCGCCCGAC
AACGTCGTCC AGCGCGCGGT CTATGACGGC GTCGGCTTCA TCCACATCCT GTCGAAGGAG
ATTTGGGACG GGCATCCCTG CTGCTCCTTC TCCGTGGCGC AGGACACGAT CCGCGACATG
CCGAACGCGA CGGCCGCGAT GCTGCGCGCC ATCCTCCAGG CGACCGCCTA CGCCTCGAAG
GTGGAGAACC GCAAGGAGAT CGCCGCCGCC ATCGCGCCCG CGAACTACCT CAACCAGCCG
CTCACCGTGG TGGAGCAGGT TCTCACCGGC ACCTATGCCG ACGGGCTCGG CAGCGTGAAG
AAGGACCCCA AGAGGGTCGC CTTCGACCCC TTCCCCTACG AGAGCTTCGC GATCTGGACG
CTGACCCAGA TGAAGCGCTG GGGCCAGATC AAGGGCGATG TCGACTACGC GGCGGTCGCA
AAGCAGGTCT ACCGCGCCAC CGACGCCGCC AAGCTGATGC AGCAGGACGG ACTGACTCCG
CCGGAGGCCA CCACCAAGAC CTTCGTGGTG ATGGGCAAGA CCTTCGATCC GGCCAAGCCC
GAGGAATACC TCGACTCCTT CAAGATCAAG CGTGCGAGCT AG
 
Protein sequence
MAPFDNSFDA KRRFARGGCS CGSHASQAAH EAALPSGDAA IERAVEATMM RALFPHQAER 
RAFLRSVGLA TAAAAVSQFL PTKFVAEAFA EAGKPEKTDL KIGFIPITCA TPIIMAKPMG
FYEKQGLNVD VVKTAGWAVV RDKTLNKEYD AAHMLAPMPL AITMGLGSNP VPFAVPAIEN
VNGQAICLAN KHKDNRDPKN WKGFKLAIPF DYSNHNYLLR YYLAEHGIDP DTDVQLRSVP
PPEMVANLRA DNIDGFLAPD NVVQRAVYDG VGFIHILSKE IWDGHPCCSF SVAQDTIRDM
PNATAAMLRA ILQATAYASK VENRKEIAAA IAPANYLNQP LTVVEQVLTG TYADGLGSVK
KDPKRVAFDP FPYESFAIWT LTQMKRWGQI KGDVDYAAVA KQVYRATDAA KLMQQDGLTP
PEATTKTFVV MGKTFDPAKP EEYLDSFKIK RAS