Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_12151 |
Symbol | |
ID | 4779617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1063716 |
End bp | 1065185 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640084494 |
Product | hypothetical protein |
Protein accession | YP_001015038 |
Protein GI | 124025922 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAATA AAGACTTACC ATCGATATGT GGGTATGAAT TTGCAATTAA CGAATTAGTA AATAGGGATA GAGATAGTAG ACAATTCAAA GAAAATTTAA ATTTCGAGAA AGTCAATTCA GGATTTGCAT GTGCCCTCCA TATGCATCAA CCAACAATCC CAGCAGGTGA GAATGGAGAG CTCATATCAC ATTTGCAATA TATGTTTGAG CACACCTCAG AAGGAGATAA TCACAATGCC GAACCATTTG CTCAATGCTA TAAACGTCTA GCTGAAATCA TTCCAAGCCT GATAAAAGAT GGACATGATC CGAAAATAAT GCTTGATTAT TCCGGAAATC TTCTATGGGG ATTTGAACAG ATGGGTCGAG AGGATATTCT ATCCTCGTTA AAACTACTGA CCTGTGATGA GACAATTTAT CCACATGTTG AATGGCTAGG AAGCTTTTGG AGTCATGCTG TAGCCTCTTC AACACCACCC TCTGACTTTA AATTACAGAT AACAGCTTGG CAACATCACT TTTCAGCATT GTTCGGAGAA GATGCATTAC GTCGAGTAAA TGGATTTTCT CTTCCTGAGA TGCATCTTCC AAACCATCCT GATGTTCTTT TTCAATTAAT AAAAGCTCTT AAAGAATGTG GGTATCGATG GTTAATGGTT CAAGAACACA GTGTTCAGAA TATAGATGGC TCAAGTCTCA GAGACGATCA AAAATATATT CCCAACATGC TCAAAGCTCA AGCAAGTAAT GGAGATACGA TCTCTATACT TTCACTAATA AAAACTCAAG GGTCAGATAC AAAGCTTGTT GGTCAAATGC AACCTTATTA CGAGGCACTA GGGCTATGTA AACAAAATTT AGGTCAACAT ATTATTCCGA AGCTGGTTTC TCAAATTGCT GATGGAGAAA ATGGCGGAGT GATGATGAAC GAATTTCCTC AAGCTTTTAT TCAGGCACAT AAAAGAATTG GCCCAAAAAC AAATATAAGT CCCACAATTG CTATGAACGG ATCTGAATAC CTCAACTTCC TAGAGACTTC AAATGTAGAT GAAGATACTT ATCCAGTGAT ACAAGCAATC GATCAACACA AAATATGGGG GAAAATATCC GGACCAATAA CACCAACAAA ATTTAAAAAA GCAATTGAAG TATTAAAAGA GGAAGATCAA TCTTTTTCTT TGAGTGGAGC TAGCTGGACT AACGACTTAA GTTGGGAAGA TGGATACAAT AACGTTTTAG AACCAATTTC AAAACTTAGT TCATATTTTC ACGAAACATT TGACCATTTA GTAGCTCAAA ATCCATCGCT AACAAAAACG CATAGTTATC AAAGAGCACT CCTTTACCTT TTGCTATTAG AAACTAGCTG CTTCCGTTAC TGGGGACAGG GGAAATGGAC TGATTACGCT AAAACGATCT TCGAAAAAGG CGAAGAGGTA CTTAGAAACA TAGAAATTTC ATCTAACTAA
|
Protein sequence | MKNKDLPSIC GYEFAINELV NRDRDSRQFK ENLNFEKVNS GFACALHMHQ PTIPAGENGE LISHLQYMFE HTSEGDNHNA EPFAQCYKRL AEIIPSLIKD GHDPKIMLDY SGNLLWGFEQ MGREDILSSL KLLTCDETIY PHVEWLGSFW SHAVASSTPP SDFKLQITAW QHHFSALFGE DALRRVNGFS LPEMHLPNHP DVLFQLIKAL KECGYRWLMV QEHSVQNIDG SSLRDDQKYI PNMLKAQASN GDTISILSLI KTQGSDTKLV GQMQPYYEAL GLCKQNLGQH IIPKLVSQIA DGENGGVMMN EFPQAFIQAH KRIGPKTNIS PTIAMNGSEY LNFLETSNVD EDTYPVIQAI DQHKIWGKIS GPITPTKFKK AIEVLKEEDQ SFSLSGASWT NDLSWEDGYN NVLEPISKLS SYFHETFDHL VAQNPSLTKT HSYQRALLYL LLLETSCFRY WGQGKWTDYA KTIFEKGEEV LRNIEISSN
|
| |