Gene NATL1_01571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01571 
Symbol 
ID4780513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp152293 
End bp154011 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content32% 
IMG OID640083421 
ProductABC transporter, ATP binding component 
Protein accessionYP_001013986 
Protein GI124024870 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGCGAC TAGAGAAGAT TAGTAAAATT TATCCCACTG GCGAAGTCTT GAAAGATGTC 
AGTTGGGAAA TTAGAAATGG AGAGAGAATT GGTTTGGTTG GAGTCAATGG AGCAGGAAAA
TCAACACAAT TAAAAATTAT TGCTGGATTA GAAGAAGCAA CTGATGGATC TTTGATTAGC
GAAGGGGATC CATCTATTGC ATATTTAAAA CAGGAATTTG ATGTTGATCT TTCAAGAACT
GTTAGAGAAG AGTTATTTGC TGCATTTAAA GAAGCATCTG ATTTACTTCA CAGTCAAAAA
TTAGTTCAAG AAAAAATGGA ATCTGAATTA GCTTCTAAAG ATTTAGATTA CCTAGATTTA
TTAATCAAAG AATTAAGCGT GATTCAAAGC AAATTTGAAT CAATAAATGG TTACGATTTA
GAATCTAAAG TTGAAAAGTT ATTACCCACT ATTGGTTTCA ATCAAAATGA AGCAGACAGA
CTAGTTGGAG ACTTCTCGGG TGGCTGGCAG ATGAGAATAG CTTTAGGAAA AATCCTATTA
CAAAGTCCTG ATTTATTGTT ACTTGATGAA CCAACTAATC ATTTAGATTT AGAAACGATT
GAATGGCTAG AGAATTATTT ACTTAATCAG AAAATTGCTA TGGTAATTGT TAGCCATGAT
AGATCTTTCT TAGATAAAAT TTGTACGAGA ATTGTTAATA CCGAGCGAGG TAAATCTAAA
AGCTATCTTG GAAATTATAC GTCATATCTT CAACAGAGAG ATTTTGAATT GGAATCAACA
AAAGTTGCAT ACGAGAAACA ACAGAAGGAT ATACAAGTTC AAAAGGCATA TATAGAAAGA
TTTCGAGCAA GTGCTACAAG AAGTACACAA GCTAAAAGTA GAGAAAAGTT ATTAGATAAA
GTTGAAAAGA TAGAAGCTCC TGAGAATAAC TTAAAAGGAC CTAATTTTAA ATTTTTGGAA
GCACCACGTG CTGGTAGGGA TATCTTAAAT ATTAAGGATT TAACCCATAG CTATGAAGAT
AATATTTTAT TTTTAGGAGC CTTTTTAGAG CTTGAGCCAG GCGAAAGAAT AGCATTTTTA
GGTCCAAATG GTTCTGGGAA ATCTACTTTA TTGCGACTAA TTATGGGGTT AGAAGAACCT
GATGAAGGAT CTATTACGAT AGGAAAATAT AATATTATAC CTAGTTATTT TGAACAAAAT
CAAGCAGAGG CCTTAGAGTT AGAAAAAACA GTAATTGAGA CAATTTCTCA ATCTGTACCT
GATTGGACAC AAACAGAAAT TCGTTCTTTA CTGGGTAGCT TTGGTTTAAC TAATGATTCG
GTTTTTAAGG AGGTCAGTCA GATTAGTGGA GGAGAGAAAG CAAGACTTGC TTTAGCTTTA
ATGATTATTA AGCCGTCAAA TTTGCTTATT CTTGATGAAC CGACAAATCA TTTAGATATA
CCTTCAAAGC AAATGCTAGA GCAGGCATTA TCCAATTATA ATGGCACTGC ATTAATAGTT
TCTCATGATC GATATTTTAT TTCAAAAGTT GCAAACAAAA TTGTAGAAAT AAGAGATGGT
CAATTAATTA AGTATCAAGG TGATTACAAA TACTATAAAG AGAAAAAAAT CGAAGAATCA
CAAGAAAAAG AAAAAGAATT ACAATTAGCT GAAAGGGAAA GAAAAAGGTT GGCTAATCGA
GAAAAACAGC GTAGGAAGAA GAAAACTAAA CAAAAATAA
 
Protein sequence
MLRLEKISKI YPTGEVLKDV SWEIRNGERI GLVGVNGAGK STQLKIIAGL EEATDGSLIS 
EGDPSIAYLK QEFDVDLSRT VREELFAAFK EASDLLHSQK LVQEKMESEL ASKDLDYLDL
LIKELSVIQS KFESINGYDL ESKVEKLLPT IGFNQNEADR LVGDFSGGWQ MRIALGKILL
QSPDLLLLDE PTNHLDLETI EWLENYLLNQ KIAMVIVSHD RSFLDKICTR IVNTERGKSK
SYLGNYTSYL QQRDFELEST KVAYEKQQKD IQVQKAYIER FRASATRSTQ AKSREKLLDK
VEKIEAPENN LKGPNFKFLE APRAGRDILN IKDLTHSYED NILFLGAFLE LEPGERIAFL
GPNGSGKSTL LRLIMGLEEP DEGSITIGKY NIIPSYFEQN QAEALELEKT VIETISQSVP
DWTQTEIRSL LGSFGLTNDS VFKEVSQISG GEKARLALAL MIIKPSNLLI LDEPTNHLDI
PSKQMLEQAL SNYNGTALIV SHDRYFISKV ANKIVEIRDG QLIKYQGDYK YYKEKKIEES
QEKEKELQLA ERERKRLANR EKQRRKKKTK QK