Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2418 |
Symbol | |
ID | 7093970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 2636795 |
End bp | 2638033 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643465740 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002362710 |
Protein GI | 217978563 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.269173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGACA GGACGCCCGA CGCAATCGCG GCCCCAGCGA CGCCCCTGCT GCGCCAGCGC AGCGTGCAGC TGTTCGTCGG CGCGCGCATG GCGATGGCCT TCTCCATGCA GATGCAGCAT GTCGCCATCG CCTGGTCCGT CTATGCGCTG ACCGGCGATC CGTTGAGCCT CGGCCTGATC GGCCTTGTCG AATTTCTTCC GTCCGTGCTG TTCATCCTCG TGACCGGCGG CGTCGCCGAC CGGCTCGATC GCCGCAAGGT TGGCGTCGTC AGCGCTGCGG CGCAGGCCTT CGCGTCGGTT TGTCTCCTTT ATGCGGCGGC CAGCGCCTCG CCGAACCTCG CGCTCATTTA CGCCATCGTA TTTTTTCTCG GGACGGCGCG CTCCTTCGCC CAGCCGGCGC TCGCGGCGCT GCTGCCCGGT ATCGTCGAAG TTGCGGATTT TCCCCGCGCG GTGGCGATGA CGACCTCCTC GCTGCAGGTC GCAATGATCG CCGGCCCGGC GGCGGGCGGT CTTCTGATGG CCTGGAGCGC TCCGGCGCTG TTTGCGCTGG CGGCGGCGCT CAACGCCGCG GCGGCGCTTA TGATTGCCTT CATCAAAGCG CGGCAGGGCC GCGGCGAAGC GGAGAAAGAA ACCGGCTTCA AAAGCCTGCT CGCCGGCTTT TCCTATGTTC GCGCCAACCG CCTCCTGCTC GGCGTCATCT CGCTCGATCT GTTCGCCGTG CTGTTTGGCG GCGTCACCGC CCTCTTGCCG ATTTACGCCC GCGACATTCT CGACGCCGGC CCGGTCGGGC TCGGATTGCT GCGCAGCGGC CCGGCGATGG GCGCGATTGT CGTCGGCCTC ATCCTCGCCC GCTTCCCGCT GCGCCGGAAC GTCGGCGCGA TCATGCTCGC ATCGGTCGCC GCCTTTGGCG CGGCGACGAT CCTTTTTGCG CTTTCGACGA AGATGCCGCT TTCCGTCGCC GCAATGATCG CCCTTGGCGG CTTCGACATG GTCAGCATGG TCATCCGCTT CACCATGGTG CAGCTCGGCA CGCCGGATGC GATGCGCGGG CGGGTCAACG CCATCAATTA TGTCTTCATC GGCGCCTCGA ACCAGCTCGG CGACTTCGAA TCGAGCGTCG TCGCCTCTTT CGCGGGGGCA AAGGGCGCGG CCGTGATCGG CGGCGTCGGA ACGCTTCTCG TCGTCGCGCT CTGGGCCTGG CGATTCCCCG AGCTGCGCCG CGCCGACCGG CTTGAATGA
|
Protein sequence | MDDRTPDAIA APATPLLRQR SVQLFVGARM AMAFSMQMQH VAIAWSVYAL TGDPLSLGLI GLVEFLPSVL FILVTGGVAD RLDRRKVGVV SAAAQAFASV CLLYAAASAS PNLALIYAIV FFLGTARSFA QPALAALLPG IVEVADFPRA VAMTTSSLQV AMIAGPAAGG LLMAWSAPAL FALAAALNAA AALMIAFIKA RQGRGEAEKE TGFKSLLAGF SYVRANRLLL GVISLDLFAV LFGGVTALLP IYARDILDAG PVGLGLLRSG PAMGAIVVGL ILARFPLRRN VGAIMLASVA AFGAATILFA LSTKMPLSVA AMIALGGFDM VSMVIRFTMV QLGTPDAMRG RVNAINYVFI GASNQLGDFE SSVVASFAGA KGAAVIGGVG TLLVVALWAW RFPELRRADR LE
|
| |