Gene NATL1_01101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01101 
Symbol 
ID4780843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp106831 
End bp108666 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content34% 
IMG OID640083373 
Producthypothetical protein 
Protein accessionYP_001013939 
Protein GI124024823 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCA CTAGCAAAAC ATATCATTTA TTAATGCGCC TACTGAAGGC ACTCCCAGTT 
AGAAGAAGAA GTTCTCTGTT GAAATTAATT CCTGTAGCAG CCTTTACTGG TTTAGTCGAT
GTGATTGTAG TTGGAATTGT TTCTAGATTA TTTACTGTTT TTATAGGTCA ACCTAATCAA
CCGCCCTTAC CATTTCAACA TTTTATACCT GAAGACCCAA AAACAAAAGT CATCAGTCTA
GTCGTCATAT ACATAGCAAT GAATTGGCTA GCATCATTTT CGAAATTATT TCTTAAAGCA
GCACAAGAAA GACTTCGAGT AGCAGTTTGG AAGGACTTAT CAGAACTAGC TCAGAAAAAA
TTATTATCTC AATCATATGA ATTCTTCTTA AACAAGAAAA AATCTGATTT ATCATCAAAA
GTTTTAATCA ACATTTCTAG AGTCTCAGAA TTCCTAGTCA AACCAATCCT TCAGATTTCT
AGTGGGTTAT GTGTGATTAC TTTTATATGT ATTGCTGTTC TTTTTATAGC AAAATCGATT
GCTCTTTATT TAATAATAAG TCTCTTAATA TTTTATATAT TTATTTCATC TTTTGTAACC
CCTTTTATAA GATATTCTTC TCGTAAAAGA ATCAAATTAG AAAAAGAAAC AAATAATATA
CTCACAGAAT CAATGCGCAC CATTATAGAT GTTCACCTTA CTAGCTCAGA ACCATACTTT
GAAAAACGCT ATAAATTAGC AAGCAAAAGC TCTTTCCCTT TCATGTGGAA AGCTGAGGTT
TTACCTGAAT TACCTAGGTC ATTAATAGAG CCATTTGGTA TCACTTTAAT TTTTGCTATT
GGTCTTTTCC CATACATCAC TGGGGAAAAC GATTCAATAC TTATTGAGAT AGTTCCATTC
TTAGCAACAA TTGCAGTAGC TGCATTAAAA CTAACTCCGC CATTACAAGA TTCATTTAGA
GCATTAACTT CAATGCGAGC ATCAATACCT GATTTAGAGG AGATACTAAA GTTGATAGAA
CTTCCTTCTA CTAGGCTAAC TAAAAGATCC ATAGGCGTTC CGACAAAAGA AGGAATTCAG
CCTAGAAACA ACATAAAGCT TGAGAAATTG AGTTATAAGT ATCCCAACAG CAACGAATAC
ACCTTAAAGG GTATCAACCT TACTATTCCT ATTGGTTCAC GAATAGCTTT TGTAGGAGAA
ACTGGAAGTG GAAAAACCAC TACCGCTAAT CAATTACTAT GTCTTCTTAG ACCAACAGAC
GGACATTTAC TATTGGATGG AGTTGCAGTT ACTGATACAG AAGTGCCTGC TTGGCAAGAT
TGTTGCTCTT ACGTTCCCCA ATCAATCACC TTATTAAATA GCAATATTAT TCAAAATATT
GCATATGGTT TAGATGAAAA AATAATTGAT CATGGAAGGG TCTGGGATGC GCTTAGAGCA
GCTCAATTAG CAGATTTGGT ATCAGAAATG CCAATGGGTT TACATTCCTC AGTTGGTGAT
AATGGCATCA GATTATCTGG TGGACAAAGA CAGCGACTAG CCATAGCAAG AGCTTTTTAT
AGGCAATCAA AATTATTAGT TTTAGATGAA GCAACTAGTG CCTTAGATAA CCGAACAGAA
GCTGAGGTAA TGAATGCAAT AGAAATAATA GGTAGACGTT GCACAATAGT CACAATTGCT
CACAGATTAT CTACAATCGA AAGATCAGAT TGTATATATG AATTTAAAGA TGGAGAAATA
GTTTCCTTTG GAAATTACCA ACAATTACTA AAGCAATCTA AAACTTTTTT TAATATGGTA
GAAATAGCAA AAAGAACATA CGGATCTAAT ATATAA
 
Protein sequence
MIGTSKTYHL LMRLLKALPV RRRSSLLKLI PVAAFTGLVD VIVVGIVSRL FTVFIGQPNQ 
PPLPFQHFIP EDPKTKVISL VVIYIAMNWL ASFSKLFLKA AQERLRVAVW KDLSELAQKK
LLSQSYEFFL NKKKSDLSSK VLINISRVSE FLVKPILQIS SGLCVITFIC IAVLFIAKSI
ALYLIISLLI FYIFISSFVT PFIRYSSRKR IKLEKETNNI LTESMRTIID VHLTSSEPYF
EKRYKLASKS SFPFMWKAEV LPELPRSLIE PFGITLIFAI GLFPYITGEN DSILIEIVPF
LATIAVAALK LTPPLQDSFR ALTSMRASIP DLEEILKLIE LPSTRLTKRS IGVPTKEGIQ
PRNNIKLEKL SYKYPNSNEY TLKGINLTIP IGSRIAFVGE TGSGKTTTAN QLLCLLRPTD
GHLLLDGVAV TDTEVPAWQD CCSYVPQSIT LLNSNIIQNI AYGLDEKIID HGRVWDALRA
AQLADLVSEM PMGLHSSVGD NGIRLSGGQR QRLAIARAFY RQSKLLVLDE ATSALDNRTE
AEVMNAIEII GRRCTIVTIA HRLSTIERSD CIYEFKDGEI VSFGNYQQLL KQSKTFFNMV
EIAKRTYGSN I