Gene NATL1_21761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21761 
Symbol 
ID4780327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1832634 
End bp1833878 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content37% 
IMG OID640085474 
Productmajor facilitator superfamily multidrug-efflux transporter 
Protein accessionYP_001015996 
Protein GI124026881 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.407538 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGAAGGC TAAAAATTCC CACCCTTTTA GGAGCTTTCA TCACACTTCT AGATGATCGA 
TTAGGCGAAA CCATTGTTTT ACCTTTATTA CCTTTTTTAT TAGAACAATT CACGACAAGC
GCGACGACTC TTGGTTTTTT AACTGGAACT TATGCGATAT CTCAGTTTGC TGCAGCCCCA
CTAATTGGAG CTATGAGTGA TCGTTTCGGT CGTAAGCCAA TCATGATCAC ATGTGTATCT
GGTTCAGTAA TAGGAATATG TCTATTTGCA TTAACTGTAA GCCTAAATTG GGATAATTAT
TTACCTTTAT GGGCCTCAAC TTTACCTTTA TCTTTACTAT TTTTAGCCAG AATAATTGAT
GGTATAAGTG GTGGTACCGC AGCTACTGCT ACTACAATAC TTGCAGATAT ATCAACTCCG
GAAAATCGCG CAAAAACCTT TGGATTAATT GGAGTAGCGT TTGGTTTAGG TTTTATTCTT
GGGCCAGGAT TAGGAACAGC TCTTGCTAAA TTTAGTGTTA CTTTACCGGT ATGGGTGGCC
AGCGGATTTG CAATATTTAA TCTTATTTTT GTAATTTGGT TTCTACCGGA AACACTGCCC
AAAAACAAAA GAAATTTACT ACCAAGAAAA AGAGATTTGA ATCCAATTAG TCAGCTACTA
GTTGTATTTA AAAACCCCTT AGCTAGAAGA CTTTGCTTAT CGTTCTTTGT TTTCTTTATG
GCATTTAATG GCTTTACAGC TGTTTTAGTC CTTTATTTAA AAGAAAAATT TGGATGGAGT
CCTGAATTAT GTAGTGCTGC TTTTATTGTC GTTGGAGTTA TTGCGATGAT TGTTCAAGGA
GGCCTAATTG GTCCTCTTGT AAAAAGATTT GGGGAGTCGA GATTAACTTT TGCTGGTATT
GGCTTTGTAA TGACAGGATG CATTCTTTTA ACGCTCGCCA ATATAGACAC TTCAATTCCT
CTTGTATTTT CTGGCGTCGC AATACTTGCA ATGGGAACTG GACTAGTAAC TCCTAGTTTA
AGAGCACTAA TTTCAAGAAG ACTAAGTTCT ATTGGTCAAG GAGCAGTATT GGGAAATCTG
CAAGGTTTAC AAAGTCTGGG AACTTTTCTT GGAGCAATAG CAGCAGGACG GTCATATGAT
CTTTTGGGTC CAAGAAGTCC ATTCTTTGGC ACAATATTGC TTCTACTATT TGTTATGTTT
TTAATTTCAG GGAAAAGTCT TACCAAGAAA AAAGTAATCT CCTAG
 
Protein sequence
MRRLKIPTLL GAFITLLDDR LGETIVLPLL PFLLEQFTTS ATTLGFLTGT YAISQFAAAP 
LIGAMSDRFG RKPIMITCVS GSVIGICLFA LTVSLNWDNY LPLWASTLPL SLLFLARIID
GISGGTAATA TTILADISTP ENRAKTFGLI GVAFGLGFIL GPGLGTALAK FSVTLPVWVA
SGFAIFNLIF VIWFLPETLP KNKRNLLPRK RDLNPISQLL VVFKNPLARR LCLSFFVFFM
AFNGFTAVLV LYLKEKFGWS PELCSAAFIV VGVIAMIVQG GLIGPLVKRF GESRLTFAGI
GFVMTGCILL TLANIDTSIP LVFSGVAILA MGTGLVTPSL RALISRRLSS IGQGAVLGNL
QGLQSLGTFL GAIAAGRSYD LLGPRSPFFG TILLLLFVMF LISGKSLTKK KVIS