Gene NATL1_08661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08661 
Symbol 
ID4779264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp799081 
End bp800337 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content28% 
IMG OID640084141 
Producthypothetical protein 
Protein accessionYP_001014689 
Protein GI124025573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.408365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000490709 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACAAAAT TATTTCTTCC TAATTATTTT TGGTTTATTT GCTTTTTACC ATTTATCCAA 
CCTTTTCCAT TCCCTTCTGA TTTACAACCA TTATGTCCAT TGATAGGGCT AATTATTTTA
ATAAGGTCTT CATTCAAAAT AAGCAAATTA GTAACTCCTA TCTTATTTTT GTTTTTAATA
GCAGTATTTA CTTGGATAAA CCCATTTTTA GGTAACTCAT TTCTTCTTGA AAAAACATAT
TTACTTAAAA TAGCTTCAAT TCCAGCCGCT TTAATAATTT ATAACTGTAC AAACAATCTT
ATAGGTTATC TAAGACCAAA ACATGTCTAT GCAACTACTT TTATTTACTT TTTAGCATTT
GTTTTTAGAA ACATATCCCC CTTTTGGTTT GTGACTATAC AGGATTATTT TGTTAACAAA
ACAAATGTAA CTTTAAATTC ATTAGCTGAG TATTTAGTAC GCTCAAATAG AGATATTGGA
ATATTATCTA CTGAACCTGC ATTTACAGCT GCTTGTTGTG CAACCTTAAT TGTTACAGCA
TTATGGTTTT TACAATCATC ATCTAAATAC GAAATAGAAG GGAATTACAT AAATAGATTG
GAAGCAAAAC TTCTTACGAT TAGTATATTA ATTAATATAC TATTAATTAT AGGAACAAAA
TCATTAAGTG GTTATGTTTA TTTATTTCTA ATCTTTCTGC CAAAGATAGG ATCATTAGTC
ATCAATAATA TTAGGCTTTT ATTAAATTTA ACAAAAAACA AAATTAGAGT ATCACGTAAT
AATCTTTTGA TAATTTCAAT TTTTCCAATT TTATTAGCCA CAGTATTATA TTTTATAAAT
TTTAATACCT ATAATTTGAA TATTAATTCT AGGTTGATTC AGGGACTTTC AGTATTATTT
AATACCCCAG AAAAAATTTA CCAAATAGAT GGAGGAAGGG TCGAAGCGAT AATTACAAAT
ACTAAAATTT TCCTAAGTCA ACCAATAACT GGCTATGGCT TCGAATTCCC AGATTTATAT
AAGATTTATC AAGCTTCAGG TGAATATGTT ACAAATAAAG GATCTATTTC AACAATTACA
TACTTTCCAG CTGCAACGGG ATTCCTATCA TTAGTAGCGT TTTATTTACT AATAAAACAG
TCACGATCTC CAATATATGC TAAATTCTTG TCATTGTGTT TTTTAACGGT ATCTTTTTCA
CTAGCTTTCC CTCCTATATG GGTTTTACTT TCTTTAAGAC CAGACAGGAA GAAATAA
 
Protein sequence
MTKLFLPNYF WFICFLPFIQ PFPFPSDLQP LCPLIGLIIL IRSSFKISKL VTPILFLFLI 
AVFTWINPFL GNSFLLEKTY LLKIASIPAA LIIYNCTNNL IGYLRPKHVY ATTFIYFLAF
VFRNISPFWF VTIQDYFVNK TNVTLNSLAE YLVRSNRDIG ILSTEPAFTA ACCATLIVTA
LWFLQSSSKY EIEGNYINRL EAKLLTISIL INILLIIGTK SLSGYVYLFL IFLPKIGSLV
INNIRLLLNL TKNKIRVSRN NLLIISIFPI LLATVLYFIN FNTYNLNINS RLIQGLSVLF
NTPEKIYQID GGRVEAIITN TKIFLSQPIT GYGFEFPDLY KIYQASGEYV TNKGSISTIT
YFPAATGFLS LVAFYLLIKQ SRSPIYAKFL SLCFLTVSFS LAFPPIWVLL SLRPDRKK