Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1148 |
Symbol | |
ID | 5835256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1256980 |
End bp | 1258341 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641366941 |
Product | ABC transporter nitrate-binding protein |
Protein accession | YP_001638621 |
Protein GI | 163850578 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.294193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.60829 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCGT TCGACAATTC CTTCGATGCC AAACGCCGCT TCGCACGCGG CGGCTGCTCC TGCGGTTCGC ACGCGTCCCA GGCCGCGCAC GAGGCGGCCC TGCCGAGCGG GGACGCCGCG ATCGAGCGGG CCGTCGAGGC GACGATGATG CGGGCTCTGT TTCCGCATCA GGCGGAGCGC CGCGCCTTCC TGCGCAGCGT CGGCCTCGCG ACGGCGGCGG CGGCCGTCAG CCAGTTCCTG CCGACGAAAT TCGTGGCCGA GGCCTTCGCC GAAGCGGGCA AGCCGGAGAA GACCGACCTC AAGATCGGCT TCATCCCGAT CACCTGCGCC ACGCCGATCA TCATGGCCAA GCCCATGGGC TTCTATGAAA AGCAGGGCCT CAACGTCGAT GTCGTGAAGA CCGCCGGCTG GGCGGTCGTG CGCGACAAGA CCCTGAACAA GGAATACGAC GCCGCCCACA TGCTGGCACC GATGCCGCTC GCCATCACGA TGGGGCTCGG ATCGAACCCC GTTCCGTTCG CCGTGCCGGC GATCGAGAAC GTCAACGGGC AGGCAATCTG CCTTGCGAAT AAGCACAAAG ACAATCGCGA CCCAAAGAAC TGGAAAGGAT TCAAACTCGC GATTCCGTTC GATTACTCGA ACCACAACTA CTTGTTGCGC TACTACCTCG CCGAGCACGG CATCGATCCC GACACCGACG TGCAACTGCG CTCGGTGCCG CCGCCCGAGA TGGTCGCGAA CCTGCGCGCC GACAACATCG ACGGCTTCCT CGCGCCCGAC AACGTCGTCC AGCGCGCGGT CTATGACGGC GTCGGCTTCA TCCACATCCT GTCGAAGGAG ATTTGGGACG GGCATCCCTG CTGCTCCTTC TCCGTGGCGC AGGACACGAT CCGCGACATG CCGAACGCGA CGGCCGCGAT GCTGCGCGCC ATCCTCCAGG CGACCGCCTA CGCCTCGAAG GTGGAGAACC GCAAGGAGAT CGCCGCCGCC ATCGCGCCCG CGAACTACCT CAACCAGCCG CTCACCGTGG TGGAGCAGGT TCTCACCGGC ACCTATGCCG ACGGGCTCGG CAGCGTGAAG AAGGACCCCA AGAGGGTCGC CTTCGACCCC TTCCCCTACG AGAGCTTCGC GATCTGGACG CTGACCCAGA TGAAGCGCTG GGGCCAGATC AAGGGCGATG TCGACTACGC GGCGGTCGCA AAGCAGGTCT ACCGCGCCAC CGACGCCGCC AAGCTGATGC AGCAGGACGG ACTGACTCCG CCGGAGGCCA CCACCAAGAC CTTCGTGGTG ATGGGCAAGA CCTTCGATCC GGCCAAGCCC GAGGAATACC TCGACTCCTT CAAGATCAAG CGTGCGAGCT AG
|
Protein sequence | MAPFDNSFDA KRRFARGGCS CGSHASQAAH EAALPSGDAA IERAVEATMM RALFPHQAER RAFLRSVGLA TAAAAVSQFL PTKFVAEAFA EAGKPEKTDL KIGFIPITCA TPIIMAKPMG FYEKQGLNVD VVKTAGWAVV RDKTLNKEYD AAHMLAPMPL AITMGLGSNP VPFAVPAIEN VNGQAICLAN KHKDNRDPKN WKGFKLAIPF DYSNHNYLLR YYLAEHGIDP DTDVQLRSVP PPEMVANLRA DNIDGFLAPD NVVQRAVYDG VGFIHILSKE IWDGHPCCSF SVAQDTIRDM PNATAAMLRA ILQATAYASK VENRKEIAAA IAPANYLNQP LTVVEQVLTG TYADGLGSVK KDPKRVAFDP FPYESFAIWT LTQMKRWGQI KGDVDYAAVA KQVYRATDAA KLMQQDGLTP PEATTKTFVV MGKTFDPAKP EEYLDSFKIK RAS
|
| |