Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_14761 |
Symbol | |
ID | 4777006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1270222 |
End bp | 1272027 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640086986 |
Product | hypothetical protein |
Protein accession | YP_001017487 |
Protein GI | 124023180 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.1592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTGCGA CCAAAACTTG TTCCACTACG CAGGTGCCCT TGGTGGTCTG GGGTGGAGGC ACAGGAGGCG TCGCCTCAGC CGTACAAGCT GCTCGACATG GAATTAGAAC TCTGCTGTTA ACACCAGGCG CGTGGCTTGG TGGAATGGTC AGTGCAGCAG GCGTTTCGGC TCCAGACGGC CATGAACTCA GTTGCTGGCA GACAGGCTTG TGGGGGGCCT TCCTTCAAGA CATCGCCAAA GTTGAACAGG ATGGACTGGA TCAGAACTGG GTGAGTTGCT TTGGTTTTAA TCCAGCTCGC GCAGAAGAGG TTTTGCGGAG CTGGGTTGTC GATCTCCCCA ATCTCGAATG GTGGTCTGAC GTTCAGCTCA AATCACTGTT ACGTGAGGGG GATCGCCTTG TTTTGCTTGA GCTGGAGCTT GAAGGGCGAC AGCATCAACT GCGCTTCGAC TTATTCATTG ATGGAAGTGA TTTAGGCGAC AGCTTTTCAC TCGCAGACAT TTCACACCGT TGGGGTTGGG AATCACAAGA ACTCTGGAAT GAGCCAAGCG CCCCTTCTGC AAAACAATTG AATTGCGATC CATTCTTTGC CAGCCAGCCT GTTCAATCAC CTACTTGGGT TGTGATGGGT CAACTCGATC AACATGCACA ATCACCTTCA TCACCAGGGT GCTTGCCTCG ACCATTCGAA GGTGCAACTG AGGCGTTTGG TTTTGAGCGC ACCGTTACCT ATGGCCGCCT TCCAGGCGGT TTAGTAATGC TCAACTGGCC CCTTCATGGA AACGACTGGC ATAAAGGCTT GGAGCGTATA CGCAGCTCAG ACGTCAAAAT TAAAAACCAG CTTGCTGCAG AAATGCAGCT GTATAGCTTG TCATTCCTAC AAGCTCTTCA AACAGTTAGT GAGGGTTGGC TTAAGCCTGG CAAGGTTTTT CCAGGGATCA ATCAAAGCCT GGCATTGATG CCCTATTGGC GAGAAGGTCG ACGCCTTAAA GGTCAATACA CCTTAGTGGA AGGAGATCTT CTGCCATTAG CCTCTGGTGC AGCCCGAGGA CCAATCCCTT TAGATAAACA GGGACGTTGC ACCAGTATTG CTGTAGGGAC TTACGCTAAT GATCATCATT ATCCTGGTAA AGATTGGCCC TTAGCCTCAA AAAGTTGTAG ATGGGGTGGG AGATGGACAG GTACTCCATT CTGCATTCCT TACGGGGCTC TACTGAGTAG CGAAGTGGAG AATATTCTGA TAGCTGATAA GGCTTTTAGT GTGAGTCATA TAGCCAATGG TGCAACTCGA TTACAGCCAA TGATTTTTAA TCTTGGCCAG GCTGCTGGTA TGGCAGCAGC TATAGCCTTG AAAAAGAGGC TTCAACCAGC AGAAGTTGAT ATTTCAGAAA TACAGCATGA ACTATTGCAT GATATTTATG CCCCAGCCGC TATTGTTCCA ATTTGGGATT GGCCTGCATG GCATCCTCAC TGGCGTAATG CACAGAAATT TGTTTTGGCG CAACCAGAGT GCCTCAGCAA CCATAGCGTG ATTGAAGGTT TTAATCTTAA AGCAAATGTA GGAGAAATGC CACTGCCAGA TCAAACACCA CTCAATAAAC ATGTGAAAAA GTTTTCAGGC TACCTTCGTA TTCATTCTGA TCAGACATTC TCTCTTGATA CCAAGTTTAA ATCCTGGCGA CTAATAACAC TGGAGCCGGC ATTAAAGCGG TGGCTGGAAT CATGTGAAGA CCAGCAGAAT GTACACCTTC TAGCAGTTGC CAATCCATGG GGACCATGGC TGAGGTTGAT TCGCATTTTA GATTAA
|
Protein sequence | MSATKTCSTT QVPLVVWGGG TGGVASAVQA ARHGIRTLLL TPGAWLGGMV SAAGVSAPDG HELSCWQTGL WGAFLQDIAK VEQDGLDQNW VSCFGFNPAR AEEVLRSWVV DLPNLEWWSD VQLKSLLREG DRLVLLELEL EGRQHQLRFD LFIDGSDLGD SFSLADISHR WGWESQELWN EPSAPSAKQL NCDPFFASQP VQSPTWVVMG QLDQHAQSPS SPGCLPRPFE GATEAFGFER TVTYGRLPGG LVMLNWPLHG NDWHKGLERI RSSDVKIKNQ LAAEMQLYSL SFLQALQTVS EGWLKPGKVF PGINQSLALM PYWREGRRLK GQYTLVEGDL LPLASGAARG PIPLDKQGRC TSIAVGTYAN DHHYPGKDWP LASKSCRWGG RWTGTPFCIP YGALLSSEVE NILIADKAFS VSHIANGATR LQPMIFNLGQ AAGMAAAIAL KKRLQPAEVD ISEIQHELLH DIYAPAAIVP IWDWPAWHPH WRNAQKFVLA QPECLSNHSV IEGFNLKANV GEMPLPDQTP LNKHVKKFSG YLRIHSDQTF SLDTKFKSWR LITLEPALKR WLESCEDQQN VHLLAVANPW GPWLRLIRIL D
|
| |