Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_05531 |
Symbol | |
ID | 4780318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 500947 |
End bp | 502209 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640083830 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_001014380 |
Protein GI | 124025264 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.417716 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAAGG AAAAACAGAC ATCTAAAAAT ATTTCTATTA ATCAGGATCC TAAAAACAAT TCCAAAAAAA TTAAAAAAAC AGCTGCCAAA GTAAACGTGA CTAAAGAAAC CCAAACTCTT CCTCAAGAAT CTTCAAATGA TCTAAAAATT GATTTAGATC TAGAGGCGGA TAAATTAATT GCTGAAGCAA ATAAAGTGCC TGAAGCCGAT ATCGATCTAG ATGATGATGA CGATAATGCC TCTGTGCTAT CAAGTGCTCA AGAAGCAGCA GCTAAAGCCT TAGCAAGTAT AAAAATAGGG CCAAAGGGTG TTTATACAGA AGACTCGATT AGAGTTTATC TACAAGAGAT TGGTCGTATA AGACTTCTCA GGCCAGACGA AGAAATTGAA TTAGCCAGGA AGATAGCAGA TCTTCTTCAA TTAGAGGAAG AAGCTGCACA ATTTGAAAGT GAGAATGGAC ATTTCCCATC AGTCAAAGAA TGGGCAGTTC TTGCAGACAT GCCATTAACT CGCTTCCGAA GGCGATTAAT GCTAGGCAGA CGAGCAAAAG AAAAAATGGT GCAATCAAAT CTACGACTAG TAGTTTCAAT TGCCAAGAAA TATATGAATA GAGGTTTGTC TTTTCAAGAT CTTATTCAAG AAGGAAGTCT TGGTCTAATT CGTGCAGCTG AAAAATTTGA TCATGAGAAA GGATATAAAT TCTCAACTTA CGCTACTTGG TGGATAAGGC AGGCTATCAC TAGAGCAATT GCAGATCAAA GCAGAACAAT TCGTTTGCCT GTGCATTTAT ACGAAACAAT TTCAAGGATC AAGAAAACCA CTAAAGTTTT AAGTCAAGAG TTTGGAAGGA AACCAACCGA GGAAGAAATC GCCGAAAGCA TGGAAATGAC GATTGAAAAA CTCAGATTCA TTGCAAAAAG CGCTCAGCTC CCCATTTCTC TTGAGACCCC TATTGGGAAA GAAGAAGATT CTAGACTTGG TGACTTTATT GAAGCAGACA TAGAAAATCC TGAACAAGAT GTGGCAAAAA CCTTATTAAG AGAAGATTTG GAGGGTGTTC TCGCTACATT AAGTCCAAGA GAGAGGGATG TTCTAAGACT TCGTTATGGC TTAGATGATG GAAGAATGAA AACCCTTGAA GAAATTGGAC AAATTTTTGA TGTAACTAGA GAGAGAATCA GACAAATAGA GGCTAAGGCT CTCAGAAAAC TCCGCCATCC AAACAGAAAC GGGGTTCTTA AAGAATATAT CAAGCTAAAT TAA
|
Protein sequence | MLKEKQTSKN ISINQDPKNN SKKIKKTAAK VNVTKETQTL PQESSNDLKI DLDLEADKLI AEANKVPEAD IDLDDDDDNA SVLSSAQEAA AKALASIKIG PKGVYTEDSI RVYLQEIGRI RLLRPDEEIE LARKIADLLQ LEEEAAQFES ENGHFPSVKE WAVLADMPLT RFRRRLMLGR RAKEKMVQSN LRLVVSIAKK YMNRGLSFQD LIQEGSLGLI RAAEKFDHEK GYKFSTYATW WIRQAITRAI ADQSRTIRLP VHLYETISRI KKTTKVLSQE FGRKPTEEEI AESMEMTIEK LRFIAKSAQL PISLETPIGK EEDSRLGDFI EADIENPEQD VAKTLLREDL EGVLATLSPR ERDVLRLRYG LDDGRMKTLE EIGQIFDVTR ERIRQIEAKA LRKLRHPNRN GVLKEYIKLN
|
| |