Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_09801 |
Symbol | |
ID | 4776901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 891566 |
End bp | 892495 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640086488 |
Product | helix-hairpin-helix DNA-binding motif-containing protein |
Protein accession | YP_001016994 |
Protein GI | 124022687 |
COG category | [S] Function unknown |
COG ID | [COG1426] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACTAA ACCTTCCATG GCAAAAATCA AGATCTGAAG TCGAAACCAG ACAAGATCCA CAACAGATAT TCCAGAGCTC TTTACTGGAA GCTGGTCGTC AACTCAGAGA GCGTCGCGAA GAGTGTGGAA TGAGCCTGCG TGATTTAGCT GAAGAAACAC GAATCACTAC CCCCGTTCTA GAAGCAATTG AACGGGGGTG GGTCAAACGA CTACCAGAGC CAGCCTATCT GTGCTCAATG CTGCCGCTGC TGGAGCAGCA TCTTGAACTA GCCCCAGGCA GCCTCAACGG GGCCATGCCA GAACGCAATG CCAGAAACCA AACTCATTCC AACAGGGGCC TGACGCGATT CACCCCCGGC AGCATCGATG TCTTCACCAC CTGGCAAGGG AGTGTGATTT ATGCCGTGGT CATGCTCAGC AGCCTCTTGG CCCTCAATCA CCAGCAAAAG CATCTTGCTG CACTCAACAG TCAAACACTC TCGCCAATCA CAGTCAGCCT TGAATCCCTT GCTGATCAGC ACGCAAGCGA GACAGCCAAC CCCGCCCTTG ATGGCCTAAG ACCCCTGGAA GAAGCACGAA AACGATCTCC AGAACAATGG CTCAACGCAA CCCTTATCCA GCACCAAGCT CAGGACGAGA TAGGTCTACT GGAGATCAAT CTCAGTCAGC CACGTATGCT CAAAATCAAC AACGCTGGGA AAGATCTCAC AAATCTCAGA GAAGCTCAGG GGACACTCAC TCTTCAATTG CGGCCACCCC TCCTACTAGA GATCAAGCCT CCAGCTGCTC CTGAGGACAG CGTCATTTGG AAGGGCCAGG CGCATGCTCA CGAACCCAAT CATCCTGGGA TTTACCGACT CGAAGATGCC GTCAGCAAAG CGGCTGCCGA TTCAAGCGAA CGTCCCCAGA CAGCTCCTCT CTCGCCATAG
|
Protein sequence | MRLNLPWQKS RSEVETRQDP QQIFQSSLLE AGRQLRERRE ECGMSLRDLA EETRITTPVL EAIERGWVKR LPEPAYLCSM LPLLEQHLEL APGSLNGAMP ERNARNQTHS NRGLTRFTPG SIDVFTTWQG SVIYAVVMLS SLLALNHQQK HLAALNSQTL SPITVSLESL ADQHASETAN PALDGLRPLE EARKRSPEQW LNATLIQHQA QDEIGLLEIN LSQPRMLKIN NAGKDLTNLR EAQGTLTLQL RPPLLLEIKP PAAPEDSVIW KGQAHAHEPN HPGIYRLEDA VSKAAADSSE RPQTAPLSP
|
| |