Gene NATL1_06321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06321 
Symbol 
ID4779460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp572599 
End bp573522 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content42% 
IMG OID640083910 
Producttype II alternative sigma-70 family RNA polymerase sigma factor 
Protein accessionYP_001014459 
Protein GI124025343 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.903812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTCTT TAAGCGACTT TCTTGGAGAA ATAGGAAGAC ACGAATTGCT CACACCAGAG 
CAAGAACTCA CTTTGGGTAG AGAAGTGCAA GCAATGGTTG CACTTAATGA ACGCTGCCAA
CAAGCTGGAG GTAATGGACC AGAGTGTGAA TATTCAAGCG CAGAGAAGAA AAGATTTAAG
GCAGGTGAAC GGGCCAAGAA TCAAATGATC ACAGCAAATC TGAGGCTTGT TGTGAATTTG
GCCAAGCGCT ATCAAGGCAA AGGGCTTGAC CTTCTTGATT TAATTCAAGA GGGCACCTTA
GGATTAACAA GAGCTGTTGA GAAGTACGAT CCCAAAAGAG GTCATCGATT TTCAACTTAC
GCTTACTGGT GGATTAGGCA AGGACTAAAT AGAGCGCTTT CAACGCAAAG TAGAACAATA
CGAATCCCTG TAAACATAAA TGAAAAACTA ACAAAACTTC GTGCTGCCAA GTCTCGACTA
ATGCAAGAGC TTGGGGTTCA TCCCTCAACT AACCAGATAG CCATACAAAT GAAAATTCCT
TTAGAGGAAG TAGAAGAATT ACTTGCATGT GAGCTAAGAA GTATCACAGT GAGCTTGCAA
GGGGCAGTCA AATCTAAAGC AGATCCCTCT GAGCTTGTCG ATATTCTTCC AAGTGAAGAA
GTACCTCCCA TGGAACTAGC TGAATTAGCA GAGAGAAGTG CATCAGCCTG GTCGTTATTG
GACAAATCAA ATCTCACACC AAAAGAGAGA ACGATACTAA GCCTGCGATT TGGCCTAGAT
GGATCGAACG AATGGAGAAC TCTAGCCGAA GTCGCCAGAC AAATGAATTG CAGCAGGGAA
TATTGCAGAC AAGTAGTTCA ACGTGCATTA AGAAAGTTGA GAAAAACAGG GATTCAAAGT
GGTCTTTTAG AGACAAGTAT TTAA
 
Protein sequence
MSSLSDFLGE IGRHELLTPE QELTLGREVQ AMVALNERCQ QAGGNGPECE YSSAEKKRFK 
AGERAKNQMI TANLRLVVNL AKRYQGKGLD LLDLIQEGTL GLTRAVEKYD PKRGHRFSTY
AYWWIRQGLN RALSTQSRTI RIPVNINEKL TKLRAAKSRL MQELGVHPST NQIAIQMKIP
LEEVEELLAC ELRSITVSLQ GAVKSKADPS ELVDILPSEE VPPMELAELA ERSASAWSLL
DKSNLTPKER TILSLRFGLD GSNEWRTLAE VARQMNCSRE YCRQVVQRAL RKLRKTGIQS
GLLETSI