Gene NATL1_21011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21011 
Symbol 
ID4780367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1743537 
End bp1746329 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content33% 
IMG OID640085397 
ProductATPase 
Protein accessionYP_001015921 
Protein GI124026806 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID[TIGR01846] type I secretion system ABC transporter, HlyB family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.625289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTGAAG GCAACGCGAG GCTAATTACA GAAAATAATA ATCAAAGTAC TACTATATGC 
AAATTTTCGC CTGGTTCATT TATTGGTATT AGTTCACTCT TAAGAGGCGA AGGTTATGAA
GAGGTTTCTG CAATTGAAAA TAGTCAAGCA CTTGCAATAC CAGCTAAAGA AATTGTTCAA
CTTTACTTAA ACGAAAAGAC CTTTCAAGAT TTTTGTAATT CTTATTTTGA ACCTTCTGAA
GCATTATCTA TATGCGAAAA ATTCATTAAA AATTCTCCTC GTTCTGATAT AAATTTAAAA
TCAGCTTGCT ATTCTATTTT TAAAAAGTGC AAACTCTTTA CTATCAATCA TAGTAATGAA
ATCATAAGAA TTAAAGATAA AAAATATATT TTAACTAGTG CAAATGTAGA AAATAAAAAA
ATATCAGAGA CAATCAATGA AGGGGAAAAT TTAAAGATAA GAGAACCTTT TCCAGCAAGA
GTTTTGGAAA TAGATATAGA TTTATATGAA AGTTTTTATT CCTTAACTAA TCCCGCTGTT
CAAGAAAAAA AGAATTCAAG CAAACCTAAA AATACTTTTA TTGAAGAACT TGAAGCACCA
GATAATCCAG AAATAAGTAA TATAAATGTT GGACAAATTG ATAACAATAA AAAATTCGAA
ATTATTCGGG CTTCAGGATA CGTCCAAGAG GTATTAGCCT GTATGAAAAT GTTATGTCGT
GAACTAAAGA TTCCATTCAG GAAAGATACT ATAGAAAAAA TACTTAGAAG TGCTTTAAGT
GGAGGGAAAG AACCAACTAT GGAGCTTTGC GGGGCGATTT CCTCAATGCT TGGTCTTAAT
GCAACTGGAC TAAGACTGGA TGCTTCAGCG GGAGTAAGAC TAAGGGTCCC TAGCCTAATA
AAAATAGAAA ATTCATTTGG ATTAATCAAA GAAAGTAATT CAAATGGATT AGTAATTGCA
TCACCAAGCA AAGGTCTTCT GAAAATTAAA TCAAGTGAAT TATCGTCTTA TTTTGAAGAT
GGTATAGATG TTATTTTACT CGATAAGAAA AATACAACTC CAGATCAAAA TTTTGGTATC
GGATGGTTCA TACCTGCTTT AAAAAAACAT AAAAGAGTTC TCATACAAGT ACTTATTGCA
TCTTTTGTAG TGCAACTTTT TACACTTGCG AATCCTTTAC TTATTCAAGT AATAATCGAC
AAAGTAATCA GTCAAAGAAG TCTTGCTACT TTAGAAGTGC TTGGAATTGC TTTATTTGTC
GTAACCATCC TTGGAGGCGT TATTGGTAGT TTACGAACAT TTTTATTTAC CGAGACAACA
AACAGAATTG ATACAACTTT AGGCGCAGAA GTTATTGATC ATTTACTAAG ATTGCCTCTG
AATTATTTTG ATAAAAGACC TGTAGGGGAA CTTGGCACAC GAGTATCTGA GTTAGAGAAA
ATTAGAAATT TTTTGACTGG TCAAGCTCTT ACAACAATAC TTGATGCAAT TTTTTCTGTT
ATTTATATAA TTGTTATGGT TATGTATAGT TGGCTTCTAA CATTCATAGC GCTAGCTGTT
GTTCCCATTC AAATAGCTTT AACTTTAATA GGAGCTCCAT TAATTAGAAG ACAAATTAGA
GAATTAGCTG AAGAGAATGC TAAAACTCAA AGCCATCTTG TTGAAGTGTT AACAGGTGTG
CAAACCGTCA AAGCTCAAAA TGTAGAAATT GTTAGCAGAT GGAAATGGCA GGATTTCTAT
CACAAATATA TTCAAAAAAC TTTTGAAAGA ACAATAACAG GTACTTCTTT AAGCGAAATA
AGCAAAGTTC TACAAAATCT TTCACAGCTT TTGGTTCTCT GGGTTGGAGC AAGTCTTGTA
TTAAAAGGCC AACTCACACT TGGACAATTA ATTGCATTCC GAATAATTTC TTCATATGTA
ACCCAACCAC TTTTAAGGCT GAGCAATATT TGGCAAGAAA TTCAGCAACT TAGAGTTTCT
TTTGAAAGAT TGGCCGACAT TATTGATACT CCAGAAGAAT CGAATGAATT TGACAAAGCC
AATATTCCAA TGCCTAATAT TAAAGGTAAA GTTTCTTTTG AAAATGTTAC CTTTTCATTT
GATTCAGGAA AAGAGAATGT CATTAAGAAC ATATCTCAAG AAATCCCATC AGGTAAATTT
GTCGGAATTG TAGGTCAAAG TGGAAGCGGT AAAAGTACAT TAATGAAATT ACTTCCTAGG
CTTTATGAAT TAAAAAAAGG AAAAATTCTA ATTGATGGAT ACGATATAAG CAAAACTGAG
CTTTACTCCC TTAGAAGACA AATAGGAATA GTTCCACAAG AGCCACTGCT CTTTTCGGGT
AGTGTTAGCG AAAATATTGC TTTAACGGAT CCAAATGCAG ATAGCGATGA TATTGTTAAT
GCTGCCAAGA TTGCAGATGC TCATAATTTC ATAATGACAC TTCCTATGGG GTATAGCACT
AATGTGGGAG AGAGAGGTGC AGCCTTATCG GGAGGTCAAA AACAAAGAAT CGCTATTGCT
AGGACAATAT TAAACAAACC TCGTTTATTA ATAATGGATG AAGCGACAAG TGCACTTGAC
TACAGCACAG AAAGACGAGT TTGTGAAAAT CTTAAAGAAT ATTGTTCTGG GTCGACTGTG
TTTTTTATAA CTCATAGATT AACTACAATT AAAAACGCCG ATATGATTCT TATGTTAGAT
AAGGGAATAA TTGCAGAAAC AGGAAATCAT AATGAATTAA TGTCAAAAAA AGGTAGATAT
TATGCACTTT ATCAACAACA AGGTGAAAGC TAA
 
Protein sequence
MLEGNARLIT ENNNQSTTIC KFSPGSFIGI SSLLRGEGYE EVSAIENSQA LAIPAKEIVQ 
LYLNEKTFQD FCNSYFEPSE ALSICEKFIK NSPRSDINLK SACYSIFKKC KLFTINHSNE
IIRIKDKKYI LTSANVENKK ISETINEGEN LKIREPFPAR VLEIDIDLYE SFYSLTNPAV
QEKKNSSKPK NTFIEELEAP DNPEISNINV GQIDNNKKFE IIRASGYVQE VLACMKMLCR
ELKIPFRKDT IEKILRSALS GGKEPTMELC GAISSMLGLN ATGLRLDASA GVRLRVPSLI
KIENSFGLIK ESNSNGLVIA SPSKGLLKIK SSELSSYFED GIDVILLDKK NTTPDQNFGI
GWFIPALKKH KRVLIQVLIA SFVVQLFTLA NPLLIQVIID KVISQRSLAT LEVLGIALFV
VTILGGVIGS LRTFLFTETT NRIDTTLGAE VIDHLLRLPL NYFDKRPVGE LGTRVSELEK
IRNFLTGQAL TTILDAIFSV IYIIVMVMYS WLLTFIALAV VPIQIALTLI GAPLIRRQIR
ELAEENAKTQ SHLVEVLTGV QTVKAQNVEI VSRWKWQDFY HKYIQKTFER TITGTSLSEI
SKVLQNLSQL LVLWVGASLV LKGQLTLGQL IAFRIISSYV TQPLLRLSNI WQEIQQLRVS
FERLADIIDT PEESNEFDKA NIPMPNIKGK VSFENVTFSF DSGKENVIKN ISQEIPSGKF
VGIVGQSGSG KSTLMKLLPR LYELKKGKIL IDGYDISKTE LYSLRRQIGI VPQEPLLFSG
SVSENIALTD PNADSDDIVN AAKIADAHNF IMTLPMGYST NVGERGAALS GGQKQRIAIA
RTILNKPRLL IMDEATSALD YSTERRVCEN LKEYCSGSTV FFITHRLTTI KNADMILMLD
KGIIAETGNH NELMSKKGRY YALYQQQGES