Gene A9601_19051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_19051 
Symbol 
ID4718644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1641064 
End bp1642335 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content31% 
IMG OID640079640 
Productmajor facilitator superfamily multidrug-efflux transporter 
Protein accessionYP_001010295 
Protein GI123969437 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGAAA GTTTATTAAA ACCAAATAAA AAATTTACTC TCCTTAGTGC CTTTATCACT 
CTTCTAAATG ATCGTTTAAG TGAAAGCATA TTACTACCTA TATTACCCTC CTTTGTTTTA
CTTTTTGATT CTAAAGCAAG TACATATGGT TTATTATCAT GCACTTACCA ATTAGCTCAA
TTTACAGCTT CTCCTTTTAT AGGACTTATG AGCGATAGAT ATGGAAGAAG ACCTGTCACT
CTTTTTTGTA TTACTGGTTC AGTCATAGGA ATATCAATAT TATCTTTTAC GGTTCTATTT
AACTGGTCAA ATTCAATAGC CTCTATCCCT TTATTTTTAT TATTTTTAGC AAGACTAATT
GACGGTTTAA GTGGGGGAAC TGCAGCTACT GCAACAACAA TTCTTGCAGA TATTTCAAGC
CCTGAAAAAA GAGCAAAAAC ATTTGGACTT ATTGGTGTAG CTTTTGGTTT AAGTTTTTTC
TTAGGTAATA TTTTTGTTGT TATTTTTGCC AAAAATACAA ATAATAATTT TATTATTCCA
GTTTTGATAG CCTCAATCAT TCCAATAATA AATTTCCTCC TTGTATTTTT TTACTTACCG
GAAACCAAGC CTAATAGTGA CTCAAATAAA TCAAAACCTT TTATAAGAAA CCCTTTAAAA
AACCTATTTA CAGTTTTCAA AGAAGAAAAG ATTAAAAAAT TATCATTAGC TTTTTTTATT
TACTTTATTG CCTTTACTGG ATTGACCAAT ATACTTATAT TCTTCCTTCA AGAATCTTTA
AACTGGACGA CAAAAGCATC AAGTGGAACT CTTGTTGTAG TAGGAATAAT TGCAATTATC
GTTCAGGGAG GACTAATTGG GCCTCTTGTA AAACAATTTG GAGAAATGCG ATTAACACTT
ATCGGATCAG GCTTCATTCT TGTTGCATGT GCTCTTTTAA TAACTGCTCC AAAAGAAAAT
GCGACAATTA ATATTTATTC AGCTGTATCA TTTTTAGCCG TTGGGGCAGG ATTAATTACG
CCCACCTTAA GAGCACTAAT ATCAAAGAAA TTAGACATTG ATAAACAAGG ATCAATTTTA
AGTAATCTTC AAGGTCTACA GAGTCTTGGG GGGGTTTTAG GAATTGCAAT GGCAGGAAGG
GTTTATGATA GTTTTGGTCC TAAATCTCCT TTTATAGCTG GTTCCGTTAT CTTGCTTTTC
ATGATATATC TTATTGCAGA GGGTAAAAGT AATAATTCTT TTAATAATCA AAAATCAAAA
GTATTGAAAT GA
 
Protein sequence
MKESLLKPNK KFTLLSAFIT LLNDRLSESI LLPILPSFVL LFDSKASTYG LLSCTYQLAQ 
FTASPFIGLM SDRYGRRPVT LFCITGSVIG ISILSFTVLF NWSNSIASIP LFLLFLARLI
DGLSGGTAAT ATTILADISS PEKRAKTFGL IGVAFGLSFF LGNIFVVIFA KNTNNNFIIP
VLIASIIPII NFLLVFFYLP ETKPNSDSNK SKPFIRNPLK NLFTVFKEEK IKKLSLAFFI
YFIAFTGLTN ILIFFLQESL NWTTKASSGT LVVVGIIAII VQGGLIGPLV KQFGEMRLTL
IGSGFILVAC ALLITAPKEN ATINIYSAVS FLAVGAGLIT PTLRALISKK LDIDKQGSIL
SNLQGLQSLG GVLGIAMAGR VYDSFGPKSP FIAGSVILLF MIYLIAEGKS NNSFNNQKSK
VLK