Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_11011 |
Symbol | |
ID | 4781138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1002751 |
End bp | 1004043 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640084380 |
Product | hypothetical protein |
Protein accession | YP_001014924 |
Protein GI | 124025808 |
COG category | [S] Function unknown |
COG ID | [COG4487] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0193058 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAAA TCAAATGTCC TGAATGTGGT AGTACGATCA GCATTGATGA AGATAGTTAC TCAAATATTA TTAAGCAGGT AAGAGATCAG GAGTTTGAAG ATGAAATGAA GAAAAGACTT GAAATTCTTG AAAGGGATAA ACAGAAATCC GTTGATCTTG CTATTCAAAA TCATCGTTTA CAAATGCAAG AGGCAGCATT TATAAATGAA AAAAAAATGC AAGGCCTTCA GTCTCAATTG ATTTCTGCCC AGGCTGAAAA AACTATGACT GTGAACAAGA TTAAACATAA CTTTGAAAAA GAGAGAGACT CACTTGAGTA TTTATTGGAG AAAACAAGAG AAAAGAATGA ATTTGATAAA AAACTTGCTG TATCTGACGC AATTACTGAA TTAAAAGAAG GCTATGAAAA ACTCAAAAAT AACTTAGATA AAGTTGAACT GCAAAAGGAA TTATCTGAAA AATCTTTGAA AATGAGATAT GAGATTCAGT TAAAAGATCG AGAAGATTTA ATTGAGAGAC TTCGTGATAT GAAAACAAAA TTGTCGACTA AAATGGTTGG TGAGTCTCTT GAGCAACATT GCGAAAATGA ATTTAATCGA ATAAGGGCGA CAGCGTTTCC TAATGCATAT TTTGACAAAG ATAATGATGC TAGTTTTGGT AGTAAAGGTG ATTATATCTT TCGTGACTGT GATAGTAAAG GGAACGAAAT AGTCTCTATT ATGTTTGAAA TGAAAAACGA ATGTGATAGC ACTTCTAGCA AGAAGAAGAA TGAGGATTTT CTTAAAGAGT TAAACAAAGA TCGTTTAGAA AAAAATTGCG AATATGCTGT ATTAGTTTCT TTACTAGAAT CTGATAATGA CTTATTTAAT GCTGGAATTG TTGATTTTTC ATATCGGTAT CCAAAAATGT ATGTTGTTCG TCCTCAATGT TTTTTACCAA TAATTTCTTT ATTAAGGAAT GCTTCACTTA AGGCTCTTGA GTATAAATCA GAACTAGCAG CTATTAAAGA GCAGAATATT GATATTACAA ATTTTGAAAA TAGTCTTGAA CTATTTAAGG ATTCTTTTGG TAAAAATTAT GCTCTAGCTT CAAAACGTTT TGAAACAGCA ATTATGGAGA TTGATAAATC TATTAATCAC TTGCAAAAAA CCAAAGATGC ATTACTTGGC GCTGATAGAA ACCTTAGATT AGCTAATGAT AAAGCACAAG ATGTTTCGGT TAAAAGATTA ACCAGAAATA ATCCTACGAT GAGAGAAAAA TTTAACTCAA TTAGAAAAAG TGATGCTGCA TAA
|
Protein sequence | MNEIKCPECG STISIDEDSY SNIIKQVRDQ EFEDEMKKRL EILERDKQKS VDLAIQNHRL QMQEAAFINE KKMQGLQSQL ISAQAEKTMT VNKIKHNFEK ERDSLEYLLE KTREKNEFDK KLAVSDAITE LKEGYEKLKN NLDKVELQKE LSEKSLKMRY EIQLKDREDL IERLRDMKTK LSTKMVGESL EQHCENEFNR IRATAFPNAY FDKDNDASFG SKGDYIFRDC DSKGNEIVSI MFEMKNECDS TSSKKKNEDF LKELNKDRLE KNCEYAVLVS LLESDNDLFN AGIVDFSYRY PKMYVVRPQC FLPIISLLRN ASLKALEYKS ELAAIKEQNI DITNFENSLE LFKDSFGKNY ALASKRFETA IMEIDKSINH LQKTKDALLG ADRNLRLAND KAQDVSVKRL TRNNPTMREK FNSIRKSDAA
|
| |