Gene NATL1_08631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08631 
Symbol 
ID4780845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp793950 
End bp795752 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content29% 
IMG OID640084138 
Producthypothetical protein 
Protein accessionYP_001014686 
Protein GI124025570 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000750178 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAAAAGAC TATTAATAAC ATTATATGAT ATTGGCCTGA AAAGGCTTTT TCAAAGATTA 
TTATATGATT TCAAAAAATC TATAAATAAA CTTTTACCAT ATCATTTATT TAATAACTCA
AAAAGATTTA ATCCTCATTT TTTAGACTTG CATCAAGAGT TGAATACATT TAAATATTCC
CCTTCACATT TTAATAAATA TATACCAAAA AAAATAGAAT TTACTTTTAT AAATAAAAAG
AGATTACTAG ATTTCCCTCT GAAATGGAAT TCGTCTGAGT GGGGAAGATT ATGGCAATTT
AATTTACACT ATTTTGATTG GTCACGTTCT ATTCTAGAAG AATCTATTAT AAATAATAAA
TGGACCGATG AATCAATAAT ATTAGAATAT CTAATAGATA ATTGGATAAA TTCTAATCTA
CCAGGAAGAG GTGATGGCTG GAACAGTTAT ACTCTATCTC TAAGAATAAG AAATTGGATA
TGGATTTTTA GAACATGCCC TAGTTTTATT AATCAGAAAA GAGTTTATTC TCTTTGGTAT
CAAATATGTT GGTTACGAAA TCACCCTGAA GAATGTCATG GAGGCAATCA TTGGATTGAA
AATCTTATAA CTTTGGTGAT AGGTAGTCTT CAATTCAATG AAGTAAGATC TAAAGAAATC
TACAGATACT CTTTATACAA ATTAAAAAAT GAACTTGAAA ATCAAATTTT ATTAGATGGT
GGACATGAAG AAAGAAGTGC TAGTTATCAT TTATTGATCT TGGATAGATT AGTTGAATTA
GGTTGTGTTT TAGATAGTGT CAATGGATAT AGGCCTATAT GGCTACTAAA ATCGATTGAA
TCTATGAATA AATGGTTAAA AATAACTTCA ATATCTAATA AAAGGCTTCC ACAGTTTAAT
GATTACTCAA TTGACAGTAA TTTAGATCTA AATATAGTCA TATCTTTTGC AGATTCTTAT
TTAAACAAAA CTAATTATCT AAGCAAAGGT TTTAGATCAA AACTTCTATC CAATTATCCA
AAGGATACAC ATATAGAAAC CTCATGTTTA AAACCAAATA CTATTGAGAT ACCCTCAATC
GTTGATCTAA AAGAAACTGG TTGGGTATTA TTGCGTCCAG ATAAAAACTG GACACTTGCA
TTTAAATCTG GGAAAGCATG CCCAAACCAT CTTCCCGCCC ATGTTCATTC TGATATCTTA
AGTTTCGATT TATTCAAAAA TGGAATCCCA ATATTTGTAT CAGCTGGCAC TAGTGAATAT
GGAAATTCAA AAAGAAGGTT CTATGAACGT TCGGGTCAAG CACACAATAT CTTACAAATT
GGTACTAGGA AGTATGGTAA TTATAATAAA ATAAATTGGA TTGAAGGTAT AGATGTATGG
GGATCTTTTA GAGCTGGTAA GAAATCAATG CCAACTTATC GAAAAAGTAA ACAATTAAAA
AATGGAGCAC TTTATACATC AGGTATTTAT GACACATATC AAAGATATGA AGCCTTTCAT
AAACGCTCTA TACAAATGAG AATTGATAAA TCAAACAACC TTATATTTTT ATTAAAAGAT
ATAATAAAAA CGGAAAATCC CATTTTTATA AGACAATGGT GGCATCTAGG TGTTGACGCC
GATGAAACTT TGCTAGAAAA GATAGCTACT CAGCTCATTA AAAATAAGAA TTTGAAAGCA
GAATATATCA ATACATATTA CTCTTCAGAA TTGGGGAAAA AAGTAAAAAG GAGAAGTTTA
TCAATCACAG GACCAATATC AGATAAGCAT ACTGTATTAT CAGTTAAACT AAATATAAAA
TAG
 
Protein sequence
MKRLLITLYD IGLKRLFQRL LYDFKKSINK LLPYHLFNNS KRFNPHFLDL HQELNTFKYS 
PSHFNKYIPK KIEFTFINKK RLLDFPLKWN SSEWGRLWQF NLHYFDWSRS ILEESIINNK
WTDESIILEY LIDNWINSNL PGRGDGWNSY TLSLRIRNWI WIFRTCPSFI NQKRVYSLWY
QICWLRNHPE ECHGGNHWIE NLITLVIGSL QFNEVRSKEI YRYSLYKLKN ELENQILLDG
GHEERSASYH LLILDRLVEL GCVLDSVNGY RPIWLLKSIE SMNKWLKITS ISNKRLPQFN
DYSIDSNLDL NIVISFADSY LNKTNYLSKG FRSKLLSNYP KDTHIETSCL KPNTIEIPSI
VDLKETGWVL LRPDKNWTLA FKSGKACPNH LPAHVHSDIL SFDLFKNGIP IFVSAGTSEY
GNSKRRFYER SGQAHNILQI GTRKYGNYNK INWIEGIDVW GSFRAGKKSM PTYRKSKQLK
NGALYTSGIY DTYQRYEAFH KRSIQMRIDK SNNLIFLLKD IIKTENPIFI RQWWHLGVDA
DETLLEKIAT QLIKNKNLKA EYINTYYSSE LGKKVKRRSL SITGPISDKH TVLSVKLNIK