Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29251 |
Symbol | |
ID | 4777996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2586596 |
End bp | 2588683 |
Gene Length | 2088 bp |
Protein Length | 695 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640088448 |
Product | 4Fe-4S ferredoxin, iron-sulfur binding protein |
Protein accession | YP_001018920 |
Protein GI | 124024613 |
COG category | [C] Energy production and conversion [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0348] Polyferredoxin [COG1221] Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCCA CCACCCCCAG CCATCAATGC ATGATGCGGA TGGCTCCCTA CCTAATCGGT CGCCCACGCC GCGGCGTCGT GGGAGGCAGC CGTTATGCCT GCAAGTTACG GGAATCCATT CGCCAAGCAG CCAATGATCC AGAGCGCAAG CCAGTCTTAA TCAGTGGTGA ACCAGGCTTA GAAAAAGACA ACATCGCCAG ACTGGTTCAT TTCGGCTCTG CCGATCGACG CCTTCTTTTG ATGAGCTTCG ATGCCAGCAA CATACGAGGG CAAGGCGTTG AACTTTTTGG CAGAGAAGGC TCTCATGAAC TGTCTCTGCT CGATTGCCTA GGGGATGGCA ACCTGCTGAT CGACTGCATT GATCTTGCAG AGCCTCAACT GCAAGCACGT CTGATTGCCC TTGCAACTGA AGGCCACCCT GCCTTCTCGG GGCGGATCTT GTTTACTGCC GAATCAAGCC TCAAAGAGCT AGAGGGGGTA GCAACTCAGA TCAGAGTGCC ACCCTTACGA GTGCGACGAT CCGATCTGGG CGATTGGCTC CGTTACAGCC TGCGCCAGCG CAGCGCCAGT CTGGGCTGGA GTCGCCCTCC CGAGTTAACC GAAGCAATTG TCAGGCGACT GCAAAGCCAC GATTTCCCGA ACAACATCCG CGAATTGGAC AGCGTGGTGG AACGGGCCCT GCAACAGACC CGCAGCCAGG CCGCGGCTGA GCAAGCCAGC GGTGCAATCA CCATGGCCCT ACCCGCAGCT TTGCCGGAAG ACGTGTTCTG GGTGAACAGC CGGGAGCCCA ACCTGCGCTT TGAGATCTGG CGCTGGAAGC CTCAACTACG TCAGTTGATG CGCTCACCCC AACTCTGGAA CGGCTTGCTT TTTGGGCTGG TGAGCTGGGT CTTCGTGCTC GTGAATCTGT GGCTGTGGCT TGGCCCACAA GACAGGGCAC ACAACAGCAT GCTCAAATTT TTCTGGGCCT GGTGGTGGCC TCTGATCCTG CTGACTTACC CGCTAGTTGG GCGACTTTGG TGTGCAGTCT GTCCATTCAT GGTCTGGGGC GAAATTGCTC AGAACAGCAA AAAGGCTCTC GCAAAATTGA TCTCAACCTT GGGTTTACCT GCCGGTTGGT TGCAACCAAG GCTCTGGCCC CATGGCGATC ACGACAGTTG GGGCGCACCT GTTTTGGCGA CAGGGTTCGC AGCGATCCTG ATCTGGGAAG AGGTCTGGAA CCTAGAAGAA ACCGCCCGAC TCAGCAGTTG TCTCTTGCTA CTGATTACCT CAGGAGCTGT GCTGTGTTCA CTGGTCTTCG AAAAGCGATT CTGGTGCCGC TATCTCTGCC CAGTTGGCGG CATGAATGGC CTATTCGCAA AGCTTTCGAT CCTCGAATTA CGTGCGCAAT CGGGCACGTG CTCAGGGAGT TGCACGTCCT ACGCCTGTTT CAAAGGAGGT CCTGCTGAAG GCGAAGGTAT GGCGAGCGAA GGTTGCCCCC TGGGCACCCA TCCAGCCCAT CTCAGCGACA ACCGCAACTG TGTCCTTTGT CTGACCTGCG CCCAAGCATG CCCGCACCAC TCCGTGCAAT TGCGACTGCG GCCTCCAGCC GCAGATCTGC AACGCAGCAT GCATGGTCCA AAAGGCGAGA AGGGATTGAT CTTGGTGTTG GCCGGTGGAA TCACCCTGCA TCATTGGCAA CGACTACTGG GCTGGTTGCC CCTGGCTCCA GAGTCTCTCC AGGAGGGGCC CCTTTTAGCG CGACTGGTGT TTGCTGCACT AGCGCTAAGC CTGCCAGCAG CTGCAGGTCT GTGGGTGGAG CGTCGCTGGC TGTATACCGC CTTACCACTG TTATGGAGCG TGCTCTTGGC CCGTCATCTA CCGATTGGCA TGGCAGAGGC CGGCACTGTG TTGCCGATTG GTTGGCCTCA ATGGAGCGCC GATCCCAACG TGATTGGCTT CTGCCAGAGC TTCGCAATTG CGGTTGGTTG GCTTGGTTGT GTGGTGTTAC TTCGGCGTCT AATCAGCCGC CAGCGACAAC CCTGGCTCAC AGCTAGCGGA GCATTCTTGG TCTTGGCGCT GGCTAGCCGC TGGGTCGTGC ACATCTAA
|
Protein sequence | MESTTPSHQC MMRMAPYLIG RPRRGVVGGS RYACKLRESI RQAANDPERK PVLISGEPGL EKDNIARLVH FGSADRRLLL MSFDASNIRG QGVELFGREG SHELSLLDCL GDGNLLIDCI DLAEPQLQAR LIALATEGHP AFSGRILFTA ESSLKELEGV ATQIRVPPLR VRRSDLGDWL RYSLRQRSAS LGWSRPPELT EAIVRRLQSH DFPNNIRELD SVVERALQQT RSQAAAEQAS GAITMALPAA LPEDVFWVNS REPNLRFEIW RWKPQLRQLM RSPQLWNGLL FGLVSWVFVL VNLWLWLGPQ DRAHNSMLKF FWAWWWPLIL LTYPLVGRLW CAVCPFMVWG EIAQNSKKAL AKLISTLGLP AGWLQPRLWP HGDHDSWGAP VLATGFAAIL IWEEVWNLEE TARLSSCLLL LITSGAVLCS LVFEKRFWCR YLCPVGGMNG LFAKLSILEL RAQSGTCSGS CTSYACFKGG PAEGEGMASE GCPLGTHPAH LSDNRNCVLC LTCAQACPHH SVQLRLRPPA ADLQRSMHGP KGEKGLILVL AGGITLHHWQ RLLGWLPLAP ESLQEGPLLA RLVFAALALS LPAAAGLWVE RRWLYTALPL LWSVLLARHL PIGMAEAGTV LPIGWPQWSA DPNVIGFCQS FAIAVGWLGC VVLLRRLISR QRQPWLTASG AFLVLALASR WVVHI
|
| |