Gene NATL1_08691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08691 
Symbol 
ID4779546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp804488 
End bp806257 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content26% 
IMG OID640084144 
Producthypothetical protein 
Protein accessionYP_001014692 
Protein GI124025576 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.109178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000878377 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGAAAAAC TAAATAAATA TAAAAAGACT ATTACATTTG AGAAATTAAT TAAAGAGAAT 
AAGATTACAC AAAAGATGAT ATCAACACTA GCAAGCAATT ACTCTAAGGA ATTGATTAGT
TCTAAAATAC TAAAGAGAAT AATGAGAAAG TCAAAGATAG AAAAGTATAA TTATATTAAA
AAAGCATTAT TAAATTATAC TGCAATAATC GTCCATGATT CATTTGGATT ATCCTTAATT
TTTGATGCCC TTATAAGAAA TGGGATGCAA AAAGCGATCT TTTATTGCAA TAGAGGTGGT
TTTCAAATAA AAGATTTGTA TGATACAAAT TATTTTTTAG ACAACAATTT ATATTACAAA
CGACATCTAC AATATTTATG TAACACAAAT GGAATAGCTT TTAAAGGTAT AAAGAAAGTT
AGCTTAGGGA ATTTTTCTAT TAAGATAAGA GATTGGACAA TATATGTTTA TCGCTTTATA
ATACTATCCT TAAGATGTAT TAAAAATAAT GAAAAGATAA AGAGGCAAAA ACTGAATGCT
ATTCATCTAA TAAGGTCTGA AGTAGAGCTT TATTCTTCAG AGCCTATTAT CAAAGAAACA
ACGGCTAGAG GAGACAACTA TATTTATATT GTAGATGATT TAATGAAGTT TCCAACATGT
ACAAAAGTAA TTAAGAGTAA AGATTATAAC TGGTTATCAA TTCATAGCTT TACCAATTTT
AAAGATATTT ATATAACTTT TATTAAAGTG ATATGGATCC TTAAAAACAT AGAGAGTTTT
AATAAAAAGA TGATACCAAA TACAAATATG AGTAAATATG GTTTCTTAGG AGAATCAAAT
CAAATCAAAT CGATATTATA CAAAATTTTA CTGAACTCAT TACCAGAAAT AATAATTCAT
GAAAAGCAAT TAAAAAAAGT ATTAAATTTA TTAAAGCCAA AATATATTGT CTCGTACGAT
CAAATAGATA AATATGGGGC AGTCCAAGGA TCTGTAGCAA AAGAAAATTC TATAGGTTCA
GTAATGATAC AAACAACAGC GATTGATGAT ATTAAATATC CATATCCCCT TAGCATGGAC
AATATGATTG TATCTTCAGA AAAAGTAAAA GATATTTTAT TGTCGTCAGG AGCCAAAAAA
AATAAAATAC ATGACTTTGG TCTTCCAAGT CTGTATGGAA TCAAGAGTAA AGGTGATAAG
AAAATAGAGG AATTATTAAA TAAGAGAGAT AATCAATTAA TTATTTTAAT AGCAACACAG
CCGTTTGTCT CTGATATAAA TTACAACGAT TTGTTAGTTA ACAATGTTAT CAATACATTG
GCAAAAAGCA CCTATAATAT AAAAATAGTG ATAAAGCCAC ATCCTCGAGA GGCAAAGCAA
AAAAATTACA TTGAAAAGCA ATCAATTCCG AAACTACATA TCGTAACTAA TTATGATAAA
TTTGAAAATT TACTTAAAAA AGCAGACATA GTTATATCAA GGACTTCAAC TGTTATTCAG
ACATCAATTA TTGGTGGTGT ACCTCCAATT TCCTATTTAG AGATGTATCC TTCTGAAATA
ATCAATAGGC TAGACTATCT TGAATCTAAA GCTACATATA AATGCTTAAC AAAAGAACAA
TTAAGTTATA TATTAAGTGA ATATATCTCA AAAGAAAGAA GAATAGATAA ATTAAAAGAA
TTCAAGAAAA ATCGGAATAG ATATATAAAT AAACAGTTTA AAGGTAATAA TAGTATCGAT
AAAACAATGA ATCTTTTAGA AAATATATAA
 
Protein sequence
MEKLNKYKKT ITFEKLIKEN KITQKMISTL ASNYSKELIS SKILKRIMRK SKIEKYNYIK 
KALLNYTAII VHDSFGLSLI FDALIRNGMQ KAIFYCNRGG FQIKDLYDTN YFLDNNLYYK
RHLQYLCNTN GIAFKGIKKV SLGNFSIKIR DWTIYVYRFI ILSLRCIKNN EKIKRQKLNA
IHLIRSEVEL YSSEPIIKET TARGDNYIYI VDDLMKFPTC TKVIKSKDYN WLSIHSFTNF
KDIYITFIKV IWILKNIESF NKKMIPNTNM SKYGFLGESN QIKSILYKIL LNSLPEIIIH
EKQLKKVLNL LKPKYIVSYD QIDKYGAVQG SVAKENSIGS VMIQTTAIDD IKYPYPLSMD
NMIVSSEKVK DILLSSGAKK NKIHDFGLPS LYGIKSKGDK KIEELLNKRD NQLIILIATQ
PFVSDINYND LLVNNVINTL AKSTYNIKIV IKPHPREAKQ KNYIEKQSIP KLHIVTNYDK
FENLLKKADI VISRTSTVIQ TSIIGGVPPI SYLEMYPSEI INRLDYLESK ATYKCLTKEQ
LSYILSEYIS KERRIDKLKE FKKNRNRYIN KQFKGNNSID KTMNLLENI