Gene NATL1_18141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18141 
Symbol 
ID4780893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1478820 
End bp1480745 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content36% 
IMG OID640085103 
Producthypothetical protein 
Protein accessionYP_001015634 
Protein GI124026519 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0376333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.70207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGATTA GTCTTGTTAA TGCTTCAACA GACTTTGGGA TAAAAAATCT TTTTAAAAAT 
TTAGATCTTC ATGTAAATAA AAAAGAGAGA CTTGGTTTGA TTGGTCCAAA TGGATCTGGC
AAGTCAACAC TTTTGAGAGT CATTGCAGGA ATTGAACCTT TGATGGAAGG AGAAAGGAGA
TGTTTATCAT CTTTGCGGAT ATCTTTAGTT GGGCAAGAAA CAAGTTACAA CAGTGAAAAA
AGTATTTTGG AAGAAGTCCT TGAAGGGTGT GGAGAAAAAA GAAAATTATT ACTTAATTTC
AGTCAGCTAA GTAGAAAAAT CGCTCAAAAT CCAGAAGATG AAGACCTTTT GAAAAAACTT
GGTCAGGCGA GTGAACTTAT GGATGCCGCT GGGGCATGGA ATTTAGAACA ACAATGCCAA
GATGTTTTAA GAAGATTAGG TATAAAAGAC TTAGATAAGC CAGTAAAAGA GCTTTCTGGT
GGTTATCGCA AAAGAGTGGG ACTTGCCGCT GCGCTTGTCT CTAAACCAGA CGTCTTACTT
CTTGATGAAC CTACTAACCA CCTAGATGCA TCTGCAGTGG AATGGCTTCA AAATTGGTTA
GACCATTATG AGGGTGCGCT GGTCTTGATA ACTCACGATA GATATGTTCT TGATCGCATT
ACCAATCGGA TGGTCGAAAT TAACAATGGA GAAACTCGCA AGTATTCGGG CAATTATCGT
GAATTTCTTC AACAAAAAGT TGAACAAGAG CAATCAGAGG CATCTACACA GAAAAAGTTT
CAGGGTGTTT TAAGAAAGGA ATTAGCTTGG TTAAGACAGG GCCCCAAAGC AAGAAGTACA
AAACAAAAAG CACGTATTCA ACGGATTGCT GAAATGCAAG CGAAACCTAA AAGTCATGTC
AAAGCTAATT TAGAAATGAA TTCATTGAGT AGAAGAATTG GAAAAATCGC AATTGAGGCT
GAAGGTGTAG GGCTATCTCT CAACAATAAA GAGAATAATT TGGATCTTTT ATGTGATTTT
ACTTATAGCT TTAGTCCAGA GGACCGAGTA GGAATTATTG GTCCAAATGG GAGTGGTAAA
TCCACTCTTT TAGATCTAAT TTCGGGTAAA AGATTGCCTA CAAGTGGGAA AATAAAACTT
GGAGAAACGG TTCATATTGG CTACTTAGAT CAACATACAA ATGACTTAAA TCAAGGGAGT
GGCTTAAACC GCAAAGTTAT CGATTTTGTG GAGGAGGCTG CATTACGAAT TGATCATGGA
GGGAAACAAA TTACAGCATC ACAACTCTTA GAAAAATTCC TCTTTCCACC CAGTCAACAA
CATAGTCCTC TGCTAAAACT TTCAGGGGGA GAAAAAAGAA GACTTGCTCT ATGCAAAATG
CTCATACAAG CTCCCAACGT ATTATTGCTT GATGAGCCTA CAAATGATTT AGATATACAA
ACACTAAGTG TGCTAGAAGA TTTTCTTGAT GATTTTAAAG GTTGTGTCGT AGTCGTATCG
CATGATAGAT ATTTTCTTGA TCGCACTATT GATCGAATTT TTAATTTTGA AAACGGTCAC
TTGCGAAGGT ATGAAGGAAA TTACTCTCGA TTTCTTGATC ATAAAATATT AGAGGAGCGA
AACAATGAAA CCAAAAAACG AGCCAAAATA GTCAATAATT CACAAAATAA GCGTGGACAA
GAAATAAAAT TAGATTCTAA AAATGATTCG AGACGATTGA GTTTTAAAGA AGCTAGAGAA
TTAAAAGAAT TAGATGTGAG ACTACCTATG TTAGAAAAAA AGAAAATATG TTTAGAAAAA
AAGATTACTG ATAGTGATGT AGACATTAGT GAAATTAGTC ATCAATTAGC AGAACTAATT
GAATCTATTC AAGAGCATGA GGATAGATGG ATTGAGCTAA GTGAGTTGTC TGAGTCAGCA
AAGTAA
 
Protein sequence
MLISLVNAST DFGIKNLFKN LDLHVNKKER LGLIGPNGSG KSTLLRVIAG IEPLMEGERR 
CLSSLRISLV GQETSYNSEK SILEEVLEGC GEKRKLLLNF SQLSRKIAQN PEDEDLLKKL
GQASELMDAA GAWNLEQQCQ DVLRRLGIKD LDKPVKELSG GYRKRVGLAA ALVSKPDVLL
LDEPTNHLDA SAVEWLQNWL DHYEGALVLI THDRYVLDRI TNRMVEINNG ETRKYSGNYR
EFLQQKVEQE QSEASTQKKF QGVLRKELAW LRQGPKARST KQKARIQRIA EMQAKPKSHV
KANLEMNSLS RRIGKIAIEA EGVGLSLNNK ENNLDLLCDF TYSFSPEDRV GIIGPNGSGK
STLLDLISGK RLPTSGKIKL GETVHIGYLD QHTNDLNQGS GLNRKVIDFV EEAALRIDHG
GKQITASQLL EKFLFPPSQQ HSPLLKLSGG EKRRLALCKM LIQAPNVLLL DEPTNDLDIQ
TLSVLEDFLD DFKGCVVVVS HDRYFLDRTI DRIFNFENGH LRRYEGNYSR FLDHKILEER
NNETKKRAKI VNNSQNKRGQ EIKLDSKNDS RRLSFKEARE LKELDVRLPM LEKKKICLEK
KITDSDVDIS EISHQLAELI ESIQEHEDRW IELSELSESA K