Gene NATL1_21781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21781 
Symbol 
ID4780313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1836223 
End bp1837245 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content41% 
IMG OID640085476 
Producttype II alternative sigma-70 family RNA polymerase sigma factor 
Protein accessionYP_001015998 
Protein GI124026883 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.185298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATCC CGCTGGAATC TGCCAAGGAA ATTTCTGATG TTTCATTGGG AAAATCTAAT 
TCGGCAAGCA CAAGTCAAAA ACTATCTGCA TCAACTGTAA GTAGTCAAAA ATCTCGAAGT
TCTAGAAGAC AAAGTAATCG TTTAGCTACT GATGCGATAG GTTTCTATCT AACAAGTATT
GGCAGAGTCC CACTTCTGAC TCCTGCAGAA GAAATTGAGC TTGCTCATCA TGTTCAACAA
ATGAAAGATT TGTTGAACCT TCCTCTTGAA GAACGCTCTA CCCGTCAGAA GCACAAAATA
AAAATGGGTA AACGGGCAAG AGATCGCATG ATGGCTGCAA ATCTTAGGCT TGTTGTAAGT
GTTGCAAAAA AATATCAAAA CCAAGGTCTC GAATTACTCG ATTTAGTTCA AGAAGGTGCC
ATTGGATTAG AAAGAGCTGT CGATAAATTT GATCCTGCAA TGGGTTATAA ATTCTCAACC
TATGCTTACT GGTGGATAAG ACAAGGGATG ACTCGTGCCA TAGATAACAG TGCTAGAACT
ATTCGACTGC CAATACATAT CAGCGAAAAA CTTTCAAAAA TGCGCCGTAT CTCGCGTGAA
TTATCTCATC GATTTGGTAG GCAACCAAAT CGACTCGAGT TGGCAAACGC GATGGGAATT
GAACCTCAAG ATCTAGAAGA TCTAGTATCT CAAAGCGCGC CATGTGCCTC TCTTGATGCT
CATGCAAGAG GCGAAGAAGA TCGTAGTACT CTAGGAGAAC TAATTCCAGA CCCAAATTCT
GATGAGCCGA TGGAAGGGAT GGATAGAAGT ATTCAAAAAG AACACCTCGG AGGCTGGTTA
TCTCAATTAA ATGAGAGAGA GCAAAAAATC ATGAGACTTC GTTTTGGCCT TGATGGAGAA
GAACCACTTA CTCTTGCTGA AATTGGAAGA CAAATAAATG TTTCTAGAGA GCGTGTAAGA
CAACTTGAGG CTAAAGCAAT TTTAAAATTG AGGGTGATGA CTACTCATCA AAACGCTGCC
TAA
 
Protein sequence
MGIPLESAKE ISDVSLGKSN SASTSQKLSA STVSSQKSRS SRRQSNRLAT DAIGFYLTSI 
GRVPLLTPAE EIELAHHVQQ MKDLLNLPLE ERSTRQKHKI KMGKRARDRM MAANLRLVVS
VAKKYQNQGL ELLDLVQEGA IGLERAVDKF DPAMGYKFST YAYWWIRQGM TRAIDNSART
IRLPIHISEK LSKMRRISRE LSHRFGRQPN RLELANAMGI EPQDLEDLVS QSAPCASLDA
HARGEEDRST LGELIPDPNS DEPMEGMDRS IQKEHLGGWL SQLNEREQKI MRLRFGLDGE
EPLTLAEIGR QINVSRERVR QLEAKAILKL RVMTTHQNAA