Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3450 |
Symbol | |
ID | 6129874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3834944 |
End bp | 3835975 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641643617 |
Product | nitrate/sulfonate/bicarbonate ABC transporter periplasmic ligand-binding protein |
Protein accession | YP_001770269 |
Protein GI | 170741614 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.302208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000774086 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCAGAC GCACCCTGCT CTCGCGGCGG CGGGCGGCCG CCCTGCTCGG CGGCGCCCTC CTGGCCGGTG CGGCCGCGCC CGGCCCCGCC GCCGCGGCCG AGGGCCGGCT GCGCATCGCC AAGCAGTTCG GCGTCGTCTA CCTGCTCCTC GACGTGGCCC TGGAGCAGCG GCTGATCGAG AAGCACGGCC GGGCCGCCGG GCTCGACATC GCGGTCGAGC CGGTGCAGCT CTCGGGCGGC GCGGCGGTCA ACGACGCGCT GCTGTCCGGC AGCATCGACA TCGCCGGGGC CGGGGTCGGC CCGCTCTTCA CCCTGTGGGA CCGCACCCGG GGCCGGCAGA ACGTCAAGGG CGTCGCCTCG CTCGGCAACT TCCCCTACCT GCTCGTCAGC AACCGGCCGC AGGTGCGGTC GATCGCCGAC CTGACCGAGG CGGACCGGAT CGCGCTGCCC GCGGTCGGCG TGTCGGTGCA GGCGCGGATC CTGCAATGGG CCGCCGCCAA GCAATGGGGC GAGGCGGATT TCGCCCGGCT CGACCGGATC AGCGTCGCGG TCCCGCATCC CGAGGCGGCG GCGGCGATCA TCAAGGGCGG CACCGAGATC AGCGCCCATT TCGGCAACCC GCCCTTCCAG GAGCAGGAAC TGGCCGAGGC CCCGGACGCC CGGGTGATCC TCAATTCCTA CGAGGTCCAG GGCGGCCCCG CCTCCTCGAC GGTGCTGTAC GCGACGGAGA CGTTCTACCG CGACAGCCCC AGGACCTACC GGGCCTTCCT CGACGCCCTC GACGAGGCGG CGACCTTCGT GGCCGCCAAC CCGGACCAGG CCGCCGAGAT CTACCTGAAG GCCAACGGCA GCCGGATCAG CCGCGATCTC CTGCTCAAGG TGATCAGGAA CCCGGACGTG ACCTTCAAGA TCGCGCCGCA GAACACGCTC GGCCTCGGCC GGTTCATGCA CCGCGTGGGC GCGATCCGCA ACGAGCCGAA GGCGCTCGCG GATTACTTCT TCGCCGATCC GCGCGTGGCC GCGGGCAGCT GA
|
Protein sequence | MIRRTLLSRR RAAALLGGAL LAGAAAPGPA AAAEGRLRIA KQFGVVYLLL DVALEQRLIE KHGRAAGLDI AVEPVQLSGG AAVNDALLSG SIDIAGAGVG PLFTLWDRTR GRQNVKGVAS LGNFPYLLVS NRPQVRSIAD LTEADRIALP AVGVSVQARI LQWAAAKQWG EADFARLDRI SVAVPHPEAA AAIIKGGTEI SAHFGNPPFQ EQELAEAPDA RVILNSYEVQ GGPASSTVLY ATETFYRDSP RTYRAFLDAL DEAATFVAAN PDQAAEIYLK ANGSRISRDL LLKVIRNPDV TFKIAPQNTL GLGRFMHRVG AIRNEPKALA DYFFADPRVA AGS
|
| |