Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4497 |
Symbol | |
ID | 5832018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 5016120 |
End bp | 5017136 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641370290 |
Product | putative nitrate transport protein |
Protein accession | YP_001641936 |
Protein GI | 163853893 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATCA GACTCGGATA CGTCCCCCTC ACCGACGCCG CCCCCGTGAT CGCGGCGGCG GAACTGGGCT TCGCACGCGC AGAGGGGCTC GAGATCGAAC TCGCGCGCGA GCCCTCCTGG GCGACCCTGC GCGACCGGCT GGCGCTCGGC CATCTCGACG CCGCGCACAT GCTCGGGCCG CTCGCCATCG CCAGCGCGCT CGGGCTCTCG GGGCCGCAGG CCCGTCTGAG CGTGCCGATG GCGCTCGGCC TCAACGGCAA CGCCGTGACC GTCTCGAACG CGCTCTGGGC GGCGATGGCG CCGGAGAGCG ACGGACTCGG TGACGTGGCC GCGGCTTTCT CGGCGGTCGC CCGCGCGCGG GCCGGGGAGG GGCGTCCGCT CGTCATCGGC ACGGTGCATC CCTTCTCCAG CCATTCCTAC CAGCTCCGCC TGTTCGCCGG CCTGAGCGGG CTCGACCTCG ACGCGACGGT GCGGTTGGTC GTGGTGCCGC CGCCGGAGAC GGTGGATGCG CTCCGGCGCG GGCGGATCGA CGGATTCTGC GTCGGCGCCC CCTGGAACAG CGTCGCGGTC GCCGCCGGCC TCGGCCGGAT CGCGGCACTC GGCTGCGAGA TCGCGCCCGA CTGCCCGGAG AAGGTGCTGG CGCTGCCCGC GGAGGGGGCC GACTTCACGG CACCTTTGGT CAGGGCCGTC CATCGGGCCG GACTTTGGTG CGCCGCCCCC GAGAACCACG AGGCCCTGAG CCGCATGCTC GCCGAACGGG CAGAACTCGA CGCGGATGCC GCGCTTCTGG CGCGCACGCT CAGCGGCGCG CTGATCGTGG ATCGGGACGG AACCGAGCGG GCGAACCCGG ACTATCTGCG CCTCGACGCG GCGACCCACC GGCCGGACCC GGAGCATGCC CGGTGGCTGG TGGCGCAGAT GGCCGCCTGC GGGCAGGTGG CGTCCGGCGA CGACGCGGCG GACCGGGCGG CAGCGCTCTA CCGGCCCGAC CTCTTCGCCG CGGCCATCGG CGGCTGA
|
Protein sequence | MRIRLGYVPL TDAAPVIAAA ELGFARAEGL EIELAREPSW ATLRDRLALG HLDAAHMLGP LAIASALGLS GPQARLSVPM ALGLNGNAVT VSNALWAAMA PESDGLGDVA AAFSAVARAR AGEGRPLVIG TVHPFSSHSY QLRLFAGLSG LDLDATVRLV VVPPPETVDA LRRGRIDGFC VGAPWNSVAV AAGLGRIAAL GCEIAPDCPE KVLALPAEGA DFTAPLVRAV HRAGLWCAAP ENHEALSRML AERAELDADA ALLARTLSGA LIVDRDGTER ANPDYLRLDA ATHRPDPEHA RWLVAQMAAC GQVASGDDAA DRAAALYRPD LFAAAIGG
|
| |