Gene NATL1_05531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05531 
Symbol 
ID4780318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp500947 
End bp502209 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content37% 
IMG OID640083830 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001014380 
Protein GI124025264 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.417716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAGG AAAAACAGAC ATCTAAAAAT ATTTCTATTA ATCAGGATCC TAAAAACAAT 
TCCAAAAAAA TTAAAAAAAC AGCTGCCAAA GTAAACGTGA CTAAAGAAAC CCAAACTCTT
CCTCAAGAAT CTTCAAATGA TCTAAAAATT GATTTAGATC TAGAGGCGGA TAAATTAATT
GCTGAAGCAA ATAAAGTGCC TGAAGCCGAT ATCGATCTAG ATGATGATGA CGATAATGCC
TCTGTGCTAT CAAGTGCTCA AGAAGCAGCA GCTAAAGCCT TAGCAAGTAT AAAAATAGGG
CCAAAGGGTG TTTATACAGA AGACTCGATT AGAGTTTATC TACAAGAGAT TGGTCGTATA
AGACTTCTCA GGCCAGACGA AGAAATTGAA TTAGCCAGGA AGATAGCAGA TCTTCTTCAA
TTAGAGGAAG AAGCTGCACA ATTTGAAAGT GAGAATGGAC ATTTCCCATC AGTCAAAGAA
TGGGCAGTTC TTGCAGACAT GCCATTAACT CGCTTCCGAA GGCGATTAAT GCTAGGCAGA
CGAGCAAAAG AAAAAATGGT GCAATCAAAT CTACGACTAG TAGTTTCAAT TGCCAAGAAA
TATATGAATA GAGGTTTGTC TTTTCAAGAT CTTATTCAAG AAGGAAGTCT TGGTCTAATT
CGTGCAGCTG AAAAATTTGA TCATGAGAAA GGATATAAAT TCTCAACTTA CGCTACTTGG
TGGATAAGGC AGGCTATCAC TAGAGCAATT GCAGATCAAA GCAGAACAAT TCGTTTGCCT
GTGCATTTAT ACGAAACAAT TTCAAGGATC AAGAAAACCA CTAAAGTTTT AAGTCAAGAG
TTTGGAAGGA AACCAACCGA GGAAGAAATC GCCGAAAGCA TGGAAATGAC GATTGAAAAA
CTCAGATTCA TTGCAAAAAG CGCTCAGCTC CCCATTTCTC TTGAGACCCC TATTGGGAAA
GAAGAAGATT CTAGACTTGG TGACTTTATT GAAGCAGACA TAGAAAATCC TGAACAAGAT
GTGGCAAAAA CCTTATTAAG AGAAGATTTG GAGGGTGTTC TCGCTACATT AAGTCCAAGA
GAGAGGGATG TTCTAAGACT TCGTTATGGC TTAGATGATG GAAGAATGAA AACCCTTGAA
GAAATTGGAC AAATTTTTGA TGTAACTAGA GAGAGAATCA GACAAATAGA GGCTAAGGCT
CTCAGAAAAC TCCGCCATCC AAACAGAAAC GGGGTTCTTA AAGAATATAT CAAGCTAAAT
TAA
 
Protein sequence
MLKEKQTSKN ISINQDPKNN SKKIKKTAAK VNVTKETQTL PQESSNDLKI DLDLEADKLI 
AEANKVPEAD IDLDDDDDNA SVLSSAQEAA AKALASIKIG PKGVYTEDSI RVYLQEIGRI
RLLRPDEEIE LARKIADLLQ LEEEAAQFES ENGHFPSVKE WAVLADMPLT RFRRRLMLGR
RAKEKMVQSN LRLVVSIAKK YMNRGLSFQD LIQEGSLGLI RAAEKFDHEK GYKFSTYATW
WIRQAITRAI ADQSRTIRLP VHLYETISRI KKTTKVLSQE FGRKPTEEEI AESMEMTIEK
LRFIAKSAQL PISLETPIGK EEDSRLGDFI EADIENPEQD VAKTLLREDL EGVLATLSPR
ERDVLRLRYG LDDGRMKTLE EIGQIFDVTR ERIRQIEAKA LRKLRHPNRN GVLKEYIKLN