Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08661 |
Symbol | |
ID | 4779264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 799081 |
End bp | 800337 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640084141 |
Product | hypothetical protein |
Protein accession | YP_001014689 |
Protein GI | 124025573 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.408365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000490709 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGACAAAAT TATTTCTTCC TAATTATTTT TGGTTTATTT GCTTTTTACC ATTTATCCAA CCTTTTCCAT TCCCTTCTGA TTTACAACCA TTATGTCCAT TGATAGGGCT AATTATTTTA ATAAGGTCTT CATTCAAAAT AAGCAAATTA GTAACTCCTA TCTTATTTTT GTTTTTAATA GCAGTATTTA CTTGGATAAA CCCATTTTTA GGTAACTCAT TTCTTCTTGA AAAAACATAT TTACTTAAAA TAGCTTCAAT TCCAGCCGCT TTAATAATTT ATAACTGTAC AAACAATCTT ATAGGTTATC TAAGACCAAA ACATGTCTAT GCAACTACTT TTATTTACTT TTTAGCATTT GTTTTTAGAA ACATATCCCC CTTTTGGTTT GTGACTATAC AGGATTATTT TGTTAACAAA ACAAATGTAA CTTTAAATTC ATTAGCTGAG TATTTAGTAC GCTCAAATAG AGATATTGGA ATATTATCTA CTGAACCTGC ATTTACAGCT GCTTGTTGTG CAACCTTAAT TGTTACAGCA TTATGGTTTT TACAATCATC ATCTAAATAC GAAATAGAAG GGAATTACAT AAATAGATTG GAAGCAAAAC TTCTTACGAT TAGTATATTA ATTAATATAC TATTAATTAT AGGAACAAAA TCATTAAGTG GTTATGTTTA TTTATTTCTA ATCTTTCTGC CAAAGATAGG ATCATTAGTC ATCAATAATA TTAGGCTTTT ATTAAATTTA ACAAAAAACA AAATTAGAGT ATCACGTAAT AATCTTTTGA TAATTTCAAT TTTTCCAATT TTATTAGCCA CAGTATTATA TTTTATAAAT TTTAATACCT ATAATTTGAA TATTAATTCT AGGTTGATTC AGGGACTTTC AGTATTATTT AATACCCCAG AAAAAATTTA CCAAATAGAT GGAGGAAGGG TCGAAGCGAT AATTACAAAT ACTAAAATTT TCCTAAGTCA ACCAATAACT GGCTATGGCT TCGAATTCCC AGATTTATAT AAGATTTATC AAGCTTCAGG TGAATATGTT ACAAATAAAG GATCTATTTC AACAATTACA TACTTTCCAG CTGCAACGGG ATTCCTATCA TTAGTAGCGT TTTATTTACT AATAAAACAG TCACGATCTC CAATATATGC TAAATTCTTG TCATTGTGTT TTTTAACGGT ATCTTTTTCA CTAGCTTTCC CTCCTATATG GGTTTTACTT TCTTTAAGAC CAGACAGGAA GAAATAA
|
Protein sequence | MTKLFLPNYF WFICFLPFIQ PFPFPSDLQP LCPLIGLIIL IRSSFKISKL VTPILFLFLI AVFTWINPFL GNSFLLEKTY LLKIASIPAA LIIYNCTNNL IGYLRPKHVY ATTFIYFLAF VFRNISPFWF VTIQDYFVNK TNVTLNSLAE YLVRSNRDIG ILSTEPAFTA ACCATLIVTA LWFLQSSSKY EIEGNYINRL EAKLLTISIL INILLIIGTK SLSGYVYLFL IFLPKIGSLV INNIRLLLNL TKNKIRVSRN NLLIISIFPI LLATVLYFIN FNTYNLNINS RLIQGLSVLF NTPEKIYQID GGRVEAIITN TKIFLSQPIT GYGFEFPDLY KIYQASGEYV TNKGSISTIT YFPAATGFLS LVAFYLLIKQ SRSPIYAKFL SLCFLTVSFS LAFPPIWVLL SLRPDRKK
|
| |