Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2228 |
Symbol | |
ID | 3784929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2528869 |
End bp | 2530254 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812316 |
Product | major facilitator transporter |
Protein accession | YP_412912 |
Protein GI | 82703346 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.568651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCCT CATCGTCTTC CGAGAAACTG TCACCCCCAG AGATACGCGC GATCGCGGGC CTGGCGAGTA TTTACGGCCT CCGCATGCTG GGAATGTTCA TCATTCTCCC GGTATTCGCG TTTTACGCCG AACATCTGCC GGGAGGCGAC AATTACACGC TGGTCGGGAT TGCACTGGGC GCGTACGGGT TGACGCAGGC GATATTGCAG GTTCCGTTCG GCTGGCTGTC TGATCGCTTC GGCCGCAAAC CGGTGATTTA TGGCGGCCTG CTCCTGTTCG CCCTGGGCAG TTTTGTCGCA GCCGCAGCAA CGGATATTTA CTGGGTAATC GCTGGCCGTG TCATCCAGGG AGCAGGGGCG ATTTCGGCGG CTGTGATGGC GCTTGCCGCG GATCTCACAC GCGAAGAACA CCGCACCAAG GCAATGGCGG CGATCGGCAT GACGATAGGA ACTACCTTCG CGCTGTCTCT GGTCATTGCG CCGTCGCTCA ACCGCATGAT TGGCGTGCCC GGAATTTTCT TCATGACAGG GGTGCTGGTG CTGCTGGCGA TGATCGTAGT TTCGCGGGTG GTTCCCAACC CGACAGACAG GCGGTTCCAT TCGGATACGG AAGCGTCCGC AGGGGGAATT TTTAATGTCC TCCGAAATCC GGAACTGTTG CGCCTCGATT TTGGTGTCTT CGCATTGCAC GCCGTACTGA TGGCCCTATG GCTGGTGGTA CCGCTATCGC TGCGGCAGGC AGGGCTGGCA GCGGATCATC ACTGGCAAAT CTATTTTCCC GCGCTGGTCA TTTCCATGCT GCTGATCATT CCAGTGATCA TCTATAGCGA AAAGAAGGCA AAGCTAAAGC AAGTGTTTGT CATATCCGTC GCTGTGCTGC TGGTAAGCCA GATCCTGCTG GCCTATACAT TCGATTCGAT ATGGGGCACT GCGGGTGCGC TGCTGGTATT CTTCACCGCC TTCAATCTGC TGGAAGCGAC GCTGCCTTCG CTCATTTCCA AGATCGCTCC CGTAGGGGCA AAAGGCACCG CCATCGGGGT CTATAGCAGT GTCCAGTTCC TGGGTACGTT TATTGGTGCC AGCGCCGGGG GCTATCTCTA TCAGCATTAC GGAAGTACCG CACTGTTTGC ATTCTGCGGG GCGCTTCTCA TGTTGTGGCT GATATTTGCC GTTACCATGA AGGCGCCCGC AGCTGTTCGT ACCAGGATGT ACCACGTGCA GGTAATGGAT ACCGGCACCG CCCACGGGCT TTCGCGGCAA CTGGCGGCGC TGCCCGGTGT GCATGAGGCG CTGGTGCTTG CGAGCGAAGG GGTGGCTTAC CTGAAAGTAG ATATGCGTGG TTTTGATGAG CAGGGCGTTG CTCAATTACT TGGAGGGGAA GCATAA
|
Protein sequence | MSPSSSSEKL SPPEIRAIAG LASIYGLRML GMFIILPVFA FYAEHLPGGD NYTLVGIALG AYGLTQAILQ VPFGWLSDRF GRKPVIYGGL LLFALGSFVA AAATDIYWVI AGRVIQGAGA ISAAVMALAA DLTREEHRTK AMAAIGMTIG TTFALSLVIA PSLNRMIGVP GIFFMTGVLV LLAMIVVSRV VPNPTDRRFH SDTEASAGGI FNVLRNPELL RLDFGVFALH AVLMALWLVV PLSLRQAGLA ADHHWQIYFP ALVISMLLII PVIIYSEKKA KLKQVFVISV AVLLVSQILL AYTFDSIWGT AGALLVFFTA FNLLEATLPS LISKIAPVGA KGTAIGVYSS VQFLGTFIGA SAGGYLYQHY GSTALFAFCG ALLMLWLIFA VTMKAPAAVR TRMYHVQVMD TGTAHGLSRQ LAALPGVHEA LVLASEGVAY LKVDMRGFDE QGVAQLLGGE A
|
| |