Gene NATL1_10661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_10661 
Symbol 
ID4779174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp984409 
End bp986145 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content32% 
IMG OID640084345 
ProductABC transporter 
Protein accessionYP_001014889 
Protein GI124025773 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.480699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0347251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA AAAACAAAAT AAATTTTATT TACCTGATAG GGAAACTAAA GGGACATTTG 
CCAACATTGC TTTTGGGTGG CATAAGCATG TTTATATACG TAATTTGTTG GCCAATACTT
GCATGGTTAT CAGGCAAGTT AATACCTGCT ATTGGTCAAG GCAATACAAA ACAAGTATTG
ATTGTAATTT TACAAGCCTT AATTATCTTT ATCATTCAAA AAACAGCACA ATATCTACAA
GATAGTCTTT TAGCAAAACC AGCATTAGCT CTAAGCCAAG ACTTACGAAC AACACTATTC
AGAAAACTTC AAAAAACTAA TATTCTTTTT ATCGAAAAAC TTTCATCTGG GGATATTGCA
TACAGACTTA CAGAAGATGT TGATCGAGTT GGGGAGGTTA TCTATAAATC GATTCAAGAT
ACAACACCAT CGATATTTCA ATTATTGGCA GTATTTGGAT ATATGATATT TATTGATTGG
AATTTATCAC TTGCAACGAT CATATTAGCG CCTTTAATTG CTTTATTAGT AAGTGATTTT
GGAGGGAAAG TATTAAAAGC ATCTGAACAA AGTCAAAACA AAATTAGTTC ATTAGCAGGT
TTGCTATCAG AGGCTGTTCA AGGTCTACCA ATGGTTAAAG CATTTGCTGT AGAAGCATGG
CTTCAAGATG ATTTTGATAA ACAAGTTAAA TTACACAAAG AGGCTAAATA TAAAATGCTA
AGGCTTGTAG CCCTTCAGCA TCCAATAGTG GGATTAATAG AAATAATAGG TATTTTAGGG
ATCCTCACAA TAGGAACTTA TAGAATACAA ACAGGAGGTA TGTCTAACGA AGAATTTGCT
AGTTATTTTA CAGCCTTAAT AATGTTAATT GATCCAATAA GCCATATTAC TACCAACTAC
AACGAATTAA AGCAAGGACA AGCTTCACTA AGAAGATTAA ATGAAATAAC AAATAAACCA
ATAGAAATAT CAAGCTCAGA CAGAGGCATT ACTCCTGATA GAATAGATGG AAAGTTATCA
TTTAAGGATG TATTTTTCTC ATATAATGAT GATAAAGAGG TTTTAAAAAA AATTAATTTA
GAGATAGATA ATGGAAAAAT AACGGCATTA GTTGGACCTT CAGGAGCAGG AAAAAGTACA
ATCTTTTCTT TAATTTTAAA ATTCATAGAA CCTTCTAATG GATCAATATT TATTGATAAT
TATAATTTAA ACAAATTAAA TACCAATTAC TTAAGAAGAT TAATAGGTAT AGTTCCGCAA
AAAACATTTA TATTTTCAGG GACCATATCA GAAGCAATAA GATTTGGAAG GCCAACCACT
AAAGCAAACA TAGTTAATGC AGCAAGAATT GCAAATGCTC ATGATTTTAT TGAAGAATTA
CCCGACGGCT ATGAAACTTT TATAGAAGAG AGGGGTAGCA ATCTTTCTGG AGGTCAGTTA
CAGAGAATAT CAATAGCAAG AGCTTTATTA GGAGATCCAA CAATTTTATT GCTAGATGAA
GCTACTAGTG CCCTAGATGC TGAATCGGAG GAATCTGTTC AGAAGGGTCT TCAACAGGCT
ATGCACAATC GAACAGTCTT GGTAATTGCT CACAGATTAT CAACAATACA AAAGGCAGAC
AAAATAGCTG TGATTGAAAA AGGCGAAGTA CTTGAAGTAG GTAGCCATAA AGAATTAATT
AATAGGCAAG GAAGATATAA GGATTTTTGC GATAAACAAA TAATCAAGAG CTATTAA
 
Protein sequence
MKQKNKINFI YLIGKLKGHL PTLLLGGISM FIYVICWPIL AWLSGKLIPA IGQGNTKQVL 
IVILQALIIF IIQKTAQYLQ DSLLAKPALA LSQDLRTTLF RKLQKTNILF IEKLSSGDIA
YRLTEDVDRV GEVIYKSIQD TTPSIFQLLA VFGYMIFIDW NLSLATIILA PLIALLVSDF
GGKVLKASEQ SQNKISSLAG LLSEAVQGLP MVKAFAVEAW LQDDFDKQVK LHKEAKYKML
RLVALQHPIV GLIEIIGILG ILTIGTYRIQ TGGMSNEEFA SYFTALIMLI DPISHITTNY
NELKQGQASL RRLNEITNKP IEISSSDRGI TPDRIDGKLS FKDVFFSYND DKEVLKKINL
EIDNGKITAL VGPSGAGKST IFSLILKFIE PSNGSIFIDN YNLNKLNTNY LRRLIGIVPQ
KTFIFSGTIS EAIRFGRPTT KANIVNAARI ANAHDFIEEL PDGYETFIEE RGSNLSGGQL
QRISIARALL GDPTILLLDE ATSALDAESE ESVQKGLQQA MHNRTVLVIA HRLSTIQKAD
KIAVIEKGEV LEVGSHKELI NRQGRYKDFC DKQIIKSY