Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_05441 |
Symbol | |
ID | 4780428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 491718 |
End bp | 492944 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640083821 |
Product | hypothetical protein |
Protein accession | YP_001014371 |
Protein GI | 124025255 |
COG category | [S] Function unknown |
COG ID | [COG1641] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00299] conserved hypothetical protein TIGR00299 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.307892 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCTA TTTTTATTGA TTGTAGTCTT GGAATATCTG GAGATATGCT CGCATCAGCT TTATTCGATT TAGGTGTTCC GCACTCTATT TTCTTGGATA ACTTAGTAAG TTTAAATATA GATAAAAATT ATAAACTAAA ATTTAAAGAG GGAGATAGTG AAGGCATAAA AGGTATTGTC TGTATGAAAA ATGAAATTCA ATTTAAGGAA TTATCTAGAA GTCTTAATGA GATAAAGAAC TTACTTCTCA ATTCAAGCTT GAATGATTAT GTGAAGAAAA AGTCTATTAA GGTTTTTGAA ATTCTTGCTG AAGCTGAAGC AGTTGTTCAC GGGAATCAAA TCTCAGATGT TCATTTTCAT GAACTAGGTT CAATAGATTC CATCCTTGAT ATTGTTAATG TTTGCTCAGC TATAGATTTT TTAAAGCCAT ACAAAATTTA TTTTTCAAAT CCACCTTCTG GGAAAGGAAT CGTATCCACT TCACATGGCC CTCTGCCTGT TCCAGTGCCT ACTGTTCTAG AGATAGCGAG GCAAAATGAA ATCCCATTAA TGGTGCTTGA TGATAAATAT TTTGGTGAAA TAACAACTCC TACTGGCATC GCATTGATAG CAACTTTTAT AGATAAGTTT GGTCAACCAA GTAATCTAAA TATTCAAAAT ATTGGTATTG GCTTAGGAAG TAAAAATATA TCTCGACCTA ACTTTTTACG TATTCTGCTA ATAGATGAAA ATGATGATTA TATGGAAAAT AATAAACCCT CTAATGAAAC TATAATTGCT CAGGAGGCTT GGATTGATGA TTCTACGCCT GAAGATGTTG CGGTTTTAAT AGATAGATTA AGGTCTGCAG GTGCCATAGA TGTTATTTGT TATTCGGTCG ATATGAAAAA AAATAGAAAA GGTATATGTA TACAAGCTAT TGTTTACCCA AAGCATAAAA ATTTACTGCG TGAAGTTTGG TTTAACTATA GTACAACAAT TGGAATAAGA GAAAATAAGA TTAGCCGCTG GATACTTCCA AGAAGAACAG TGAGTCATAA AACTAAATTT GGGACAGTTA ATGTTAAACA AGCAATGAGA CCAAATGGTC TTAATTCAAT AAAAATAGAA CATAAAGACT TGACTCGAAT AACTTTAAAT ACAGGAATTC CAATAGAAGA GATACGTCAG AAATTAATCA TAGAATTATC AGAATTTTAT GAAATCGATG ATTGGTCTTT TTTATGA
|
Protein sequence | MKSIFIDCSL GISGDMLASA LFDLGVPHSI FLDNLVSLNI DKNYKLKFKE GDSEGIKGIV CMKNEIQFKE LSRSLNEIKN LLLNSSLNDY VKKKSIKVFE ILAEAEAVVH GNQISDVHFH ELGSIDSILD IVNVCSAIDF LKPYKIYFSN PPSGKGIVST SHGPLPVPVP TVLEIARQNE IPLMVLDDKY FGEITTPTGI ALIATFIDKF GQPSNLNIQN IGIGLGSKNI SRPNFLRILL IDENDDYMEN NKPSNETIIA QEAWIDDSTP EDVAVLIDRL RSAGAIDVIC YSVDMKKNRK GICIQAIVYP KHKNLLREVW FNYSTTIGIR ENKISRWILP RRTVSHKTKF GTVNVKQAMR PNGLNSIKIE HKDLTRITLN TGIPIEEIRQ KLIIELSEFY EIDDWSFL
|
| |