Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21781 |
Symbol | |
ID | 4780313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1836223 |
End bp | 1837245 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640085476 |
Product | type II alternative sigma-70 family RNA polymerase sigma factor |
Protein accession | YP_001015998 |
Protein GI | 124026883 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.185298 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGATCC CGCTGGAATC TGCCAAGGAA ATTTCTGATG TTTCATTGGG AAAATCTAAT TCGGCAAGCA CAAGTCAAAA ACTATCTGCA TCAACTGTAA GTAGTCAAAA ATCTCGAAGT TCTAGAAGAC AAAGTAATCG TTTAGCTACT GATGCGATAG GTTTCTATCT AACAAGTATT GGCAGAGTCC CACTTCTGAC TCCTGCAGAA GAAATTGAGC TTGCTCATCA TGTTCAACAA ATGAAAGATT TGTTGAACCT TCCTCTTGAA GAACGCTCTA CCCGTCAGAA GCACAAAATA AAAATGGGTA AACGGGCAAG AGATCGCATG ATGGCTGCAA ATCTTAGGCT TGTTGTAAGT GTTGCAAAAA AATATCAAAA CCAAGGTCTC GAATTACTCG ATTTAGTTCA AGAAGGTGCC ATTGGATTAG AAAGAGCTGT CGATAAATTT GATCCTGCAA TGGGTTATAA ATTCTCAACC TATGCTTACT GGTGGATAAG ACAAGGGATG ACTCGTGCCA TAGATAACAG TGCTAGAACT ATTCGACTGC CAATACATAT CAGCGAAAAA CTTTCAAAAA TGCGCCGTAT CTCGCGTGAA TTATCTCATC GATTTGGTAG GCAACCAAAT CGACTCGAGT TGGCAAACGC GATGGGAATT GAACCTCAAG ATCTAGAAGA TCTAGTATCT CAAAGCGCGC CATGTGCCTC TCTTGATGCT CATGCAAGAG GCGAAGAAGA TCGTAGTACT CTAGGAGAAC TAATTCCAGA CCCAAATTCT GATGAGCCGA TGGAAGGGAT GGATAGAAGT ATTCAAAAAG AACACCTCGG AGGCTGGTTA TCTCAATTAA ATGAGAGAGA GCAAAAAATC ATGAGACTTC GTTTTGGCCT TGATGGAGAA GAACCACTTA CTCTTGCTGA AATTGGAAGA CAAATAAATG TTTCTAGAGA GCGTGTAAGA CAACTTGAGG CTAAAGCAAT TTTAAAATTG AGGGTGATGA CTACTCATCA AAACGCTGCC TAA
|
Protein sequence | MGIPLESAKE ISDVSLGKSN SASTSQKLSA STVSSQKSRS SRRQSNRLAT DAIGFYLTSI GRVPLLTPAE EIELAHHVQQ MKDLLNLPLE ERSTRQKHKI KMGKRARDRM MAANLRLVVS VAKKYQNQGL ELLDLVQEGA IGLERAVDKF DPAMGYKFST YAYWWIRQGM TRAIDNSART IRLPIHISEK LSKMRRISRE LSHRFGRQPN RLELANAMGI EPQDLEDLVS QSAPCASLDA HARGEEDRST LGELIPDPNS DEPMEGMDRS IQKEHLGGWL SQLNEREQKI MRLRFGLDGE EPLTLAEIGR QINVSRERVR QLEAKAILKL RVMTTHQNAA
|
| |