Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01761 |
Symbol | |
ID | 4781084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 167973 |
End bp | 169154 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640083440 |
Product | hypothetical protein |
Protein accession | YP_001014005 |
Protein GI | 124024889 |
COG category | [S] Function unknown |
COG ID | [COG3146] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.46624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAATA TAAAAATCAA ATGGCATGCC ACAATCCAAG AAATTCCAAA AATTATTTGG AATAATTTCT TAAAAGGAAA TTCAACTCCT TTTTATAAAT GGGATTGGTT GAATGCATTA GAAAGATCAA AAAGTGTTAG TACAAAATAT GGATGGCAAC CATTATTTCT CTCTGCGTGG AGTGAAAATA ATTTAATTGC ATGTGCACCT CTCTATCTTA AATCTCATAG TTATGGAGAA TTTATTTTTG ATAATGCCTT TGTTCAACTA GCTCAAGATA TGGGACTTCA ATATTATCCC AAGCTAATAG GAATGAGTCC ATTAAGTCCA ATAGAGGGAT ATCGCTTTCT GTTTGCAGAA GGAGTTAATG ATAGAGACCT CACACAAATA TTAATCTCTG AAATTGATAG TTTTGCCAAA CAAAATGGAA TTCTTAGTTG TAATTTTTTG TATGTAGATC CTAAATGGAT GAAAGTAGCT GAATCTCTAA ATTGCGCTAA GTGGGTCAAC CAACAAAGTC TGTTGACATT GAATGAAGAA AAAAGTTTTT CTGATTTTTT ACAAAAATTC AATTCCAATC AACGTAGAAA TATTAAGAGA GAAAGAGAAA GCATAAAAAA ATGTGGAGTA AAAGTTGAAG CTCTTAGTGG GTCTCAAATA GATGTAATGA ATTTAAAAAA AATGCATTAT TTTTATCAGC TTCATTGTTC AAGATGGGGA GTATGGGGAA GTAAATACCT CACGGAATCA TTTTTTACTG AACTTAGATC AACAGAACTC AAAGAAAATA TTGTTTTATT TGACGCAAAA GAAGTAGGAA TTGATAAAAC AATTGGAATG TCCTTATGCG TGAAAAACGA AAATATGCTT TGGGGACGAT ATTGGGGTGC AGAAAAAAAT ATAGATAATT TACATTTTGA AGCTTGTTAT TACTCCCCGA TTGAGTGGGC AATAGCAAAT AAAATAAAAT ATTTTGACCC TGGAGCAGGA GGTAGTCACA AAAAACGCAG AGGTTTTATT GCTAAACCCA ATGCAAGTCT TCATAGATGG TACAACTTAC CTATGGATTC ATTAATTAGA GAATGGCTAC CAAGAGCAAA TAAGTTAATG CTTGATCAAA TAAACGCTAC AAATAATGAA GTACCTTTTA AGTTTGAAGA GCCAAAACTA TCAAATACAT AG
|
Protein sequence | MNNIKIKWHA TIQEIPKIIW NNFLKGNSTP FYKWDWLNAL ERSKSVSTKY GWQPLFLSAW SENNLIACAP LYLKSHSYGE FIFDNAFVQL AQDMGLQYYP KLIGMSPLSP IEGYRFLFAE GVNDRDLTQI LISEIDSFAK QNGILSCNFL YVDPKWMKVA ESLNCAKWVN QQSLLTLNEE KSFSDFLQKF NSNQRRNIKR ERESIKKCGV KVEALSGSQI DVMNLKKMHY FYQLHCSRWG VWGSKYLTES FFTELRSTEL KENIVLFDAK EVGIDKTIGM SLCVKNENML WGRYWGAEKN IDNLHFEACY YSPIEWAIAN KIKYFDPGAG GSHKKRRGFI AKPNASLHRW YNLPMDSLIR EWLPRANKLM LDQINATNNE VPFKFEEPKL SNT
|
| |