Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21831 |
Symbol | |
ID | 4780284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1843102 |
End bp | 1844421 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640085481 |
Product | hypothetical protein |
Protein accession | YP_001016003 |
Protein GI | 124026888 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.285408 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAAA ATGTTTATTT CCTAAATTTA AAAAATAATT TTTCAGACAA TGTTGGCTGG ACTTGTTTTC AGTTAGGTGT ATTTTGTTTG CCGTCCAGCG CGTTAATTTC TTGTGTATTT TTGCTTGTAG CTCTTTTTGA AGGAGGTCTT AAAAGAAGAG ATCTCTATTG GAGGGAACAT TGGAATTATC CGCTAGTGCT CGTAACTTTT TTGATGATAA TTGGTTCAAT TCGTTCTCAT ACCGGCTGGC TAGCTTGGCT TGGTCTTTTT AATTGGTTGC CATTCTTCTT GTGTTTTTGG GGATTGCAAC CATATTTGCT AACTCCTGAA AGAAGAAAAA AATGTGCTTC TTGGCTTGTC TTCGGAAGTC TTCCTGTTTT AATAACTGGC TTTGGTCAGT TATGGTTTGG GTGGGAGGGG CCTTGGCAGG TTTTTGATGG TTTAATAATT TGGTTTATTT CTCCAGGAGG AGAGCCGCTA GGCAGATTGT CTGGTTTGTT TGATTACGCC AATATTGCTG CTGCATGGTT ATCGGGTGTA TGGCCATTCT GTTTGGCGTC TGTTTTGCAT CCTTTTATTC TTGGAAGAAA CCGTGCTATT CCATTTGCTC TTTTAATTGC TTTTATTTCA GGAATGATTT TGACTGACTC TAGAAATGCA TGGGGTGCAA TTTTTTTAGC CTTACCATTA GTTTTTGGTT CAGCATCATG GAGTTGGTTA ATTCCTTTAA TGCTTATTTG TTCTCTTCCG GTGATTATTG CGGTTTTACC ATTTTTTGAC TTTGGGATTC AACAATTTGC AAGAAGTATT GTTCCTGAAT CTATTTGGAT GAGACTAAAT GATATGCAAT TCGTTGATAC TAGACCTTTT GAAGCAACGC GCATTGGTCA ATGGAAAATA GCATTTAACT TAATTTTTGA GAAACCATGG TTTGGTTGGG GGGCAGCAGC TTTTTCAATT ATTTATCCTT TAAGGACAGG CTTATCTCAT GGACACTCTC ATAATTTGCC TTTAGAATTA GCAATTAGTC ATGGTATAAT AGTCTCTTTT TTAATTAATA TATTTGTGTT TAGCTTATTA TTAATTTCTT CCTTCAAGAG AATATTTAAC AATTTAAATC TTCAACAAAA TCTCATAGTA GATCGAGCTT GGTGGACCTC TACTTTAATT CTTATTTGTT TTCATGCGAC AGATATTCCA CTTTTTGATA GCAGAATTAA TATCTTAGGT TGGATATTAT TGATAGGTCT TAGATGTATG ATCTATAATT CTACGAGCTA CAATAATTCA TTAAAACAAT TTAATAATGT ATTAAATTAA
|
Protein sequence | MIKNVYFLNL KNNFSDNVGW TCFQLGVFCL PSSALISCVF LLVALFEGGL KRRDLYWREH WNYPLVLVTF LMIIGSIRSH TGWLAWLGLF NWLPFFLCFW GLQPYLLTPE RRKKCASWLV FGSLPVLITG FGQLWFGWEG PWQVFDGLII WFISPGGEPL GRLSGLFDYA NIAAAWLSGV WPFCLASVLH PFILGRNRAI PFALLIAFIS GMILTDSRNA WGAIFLALPL VFGSASWSWL IPLMLICSLP VIIAVLPFFD FGIQQFARSI VPESIWMRLN DMQFVDTRPF EATRIGQWKI AFNLIFEKPW FGWGAAAFSI IYPLRTGLSH GHSHNLPLEL AISHGIIVSF LINIFVFSLL LISSFKRIFN NLNLQQNLIV DRAWWTSTLI LICFHATDIP LFDSRINILG WILLIGLRCM IYNSTSYNNS LKQFNNVLN
|
| |