Gene Nmul_A2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2228 
Symbol 
ID3784929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2528869 
End bp2530254 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content57% 
IMG OID637812316 
Productmajor facilitator transporter 
Protein accessionYP_412912 
Protein GI82703346 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.568651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCCT CATCGTCTTC CGAGAAACTG TCACCCCCAG AGATACGCGC GATCGCGGGC 
CTGGCGAGTA TTTACGGCCT CCGCATGCTG GGAATGTTCA TCATTCTCCC GGTATTCGCG
TTTTACGCCG AACATCTGCC GGGAGGCGAC AATTACACGC TGGTCGGGAT TGCACTGGGC
GCGTACGGGT TGACGCAGGC GATATTGCAG GTTCCGTTCG GCTGGCTGTC TGATCGCTTC
GGCCGCAAAC CGGTGATTTA TGGCGGCCTG CTCCTGTTCG CCCTGGGCAG TTTTGTCGCA
GCCGCAGCAA CGGATATTTA CTGGGTAATC GCTGGCCGTG TCATCCAGGG AGCAGGGGCG
ATTTCGGCGG CTGTGATGGC GCTTGCCGCG GATCTCACAC GCGAAGAACA CCGCACCAAG
GCAATGGCGG CGATCGGCAT GACGATAGGA ACTACCTTCG CGCTGTCTCT GGTCATTGCG
CCGTCGCTCA ACCGCATGAT TGGCGTGCCC GGAATTTTCT TCATGACAGG GGTGCTGGTG
CTGCTGGCGA TGATCGTAGT TTCGCGGGTG GTTCCCAACC CGACAGACAG GCGGTTCCAT
TCGGATACGG AAGCGTCCGC AGGGGGAATT TTTAATGTCC TCCGAAATCC GGAACTGTTG
CGCCTCGATT TTGGTGTCTT CGCATTGCAC GCCGTACTGA TGGCCCTATG GCTGGTGGTA
CCGCTATCGC TGCGGCAGGC AGGGCTGGCA GCGGATCATC ACTGGCAAAT CTATTTTCCC
GCGCTGGTCA TTTCCATGCT GCTGATCATT CCAGTGATCA TCTATAGCGA AAAGAAGGCA
AAGCTAAAGC AAGTGTTTGT CATATCCGTC GCTGTGCTGC TGGTAAGCCA GATCCTGCTG
GCCTATACAT TCGATTCGAT ATGGGGCACT GCGGGTGCGC TGCTGGTATT CTTCACCGCC
TTCAATCTGC TGGAAGCGAC GCTGCCTTCG CTCATTTCCA AGATCGCTCC CGTAGGGGCA
AAAGGCACCG CCATCGGGGT CTATAGCAGT GTCCAGTTCC TGGGTACGTT TATTGGTGCC
AGCGCCGGGG GCTATCTCTA TCAGCATTAC GGAAGTACCG CACTGTTTGC ATTCTGCGGG
GCGCTTCTCA TGTTGTGGCT GATATTTGCC GTTACCATGA AGGCGCCCGC AGCTGTTCGT
ACCAGGATGT ACCACGTGCA GGTAATGGAT ACCGGCACCG CCCACGGGCT TTCGCGGCAA
CTGGCGGCGC TGCCCGGTGT GCATGAGGCG CTGGTGCTTG CGAGCGAAGG GGTGGCTTAC
CTGAAAGTAG ATATGCGTGG TTTTGATGAG CAGGGCGTTG CTCAATTACT TGGAGGGGAA
GCATAA
 
Protein sequence
MSPSSSSEKL SPPEIRAIAG LASIYGLRML GMFIILPVFA FYAEHLPGGD NYTLVGIALG 
AYGLTQAILQ VPFGWLSDRF GRKPVIYGGL LLFALGSFVA AAATDIYWVI AGRVIQGAGA
ISAAVMALAA DLTREEHRTK AMAAIGMTIG TTFALSLVIA PSLNRMIGVP GIFFMTGVLV
LLAMIVVSRV VPNPTDRRFH SDTEASAGGI FNVLRNPELL RLDFGVFALH AVLMALWLVV
PLSLRQAGLA ADHHWQIYFP ALVISMLLII PVIIYSEKKA KLKQVFVISV AVLLVSQILL
AYTFDSIWGT AGALLVFFTA FNLLEATLPS LISKIAPVGA KGTAIGVYSS VQFLGTFIGA
SAGGYLYQHY GSTALFAFCG ALLMLWLIFA VTMKAPAAVR TRMYHVQVMD TGTAHGLSRQ
LAALPGVHEA LVLASEGVAY LKVDMRGFDE QGVAQLLGGE A