Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_12771 |
Symbol | |
ID | 4777319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1097629 |
End bp | 1098789 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640086785 |
Product | hypothetical protein |
Protein accession | YP_001017289 |
Protein GI | 124022982 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0568502 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGC TCAGCAACAC CGAACCCTGG CTAAAAACCT TCCGCAATCT CGTCCGCAAG ACGACTGCTG AGGATTGGTG GATTGTGAAA TCTGGCAATC GCATACGGCT TCAGGTTCCT GGTGTTGGCA GCAAAGTGTT GCCCTACGAC TGGTCGGAAG AAGGTGCAGC CCATGCCTTG CCCCGCATTC AGCAGATCTT TAAGCGTTGG GCTGATGGAA ACATCACACT TGCAGAGGCA GCGCAAACTG CTGATACCAG CAGTTCCAAG CAGCAACTGA ACTTCGACGA GCTGATCGAG AGCTATAGAA AGTTCGTCCC CAACGCTGGC GATAAAACCT GGAAGAAGAA TTATCTGCCC GTACTGCGCA ATTGCCGAGA CAAGTTCAAA GGAACCCCAC CACGTGATGG CGAAATACTC TGCATGGGAT GCCTTGAGCA ATGGGAGCAA GGCTCCAGGT CCCGTCAGAT CAGCCGTCAA AAGCTTTATG GCTTTCTGAC TTGGGCGGTG CAACGTGGTC ACCTAAAAAC GATCTACTTG CCTCCTACGT CCCTGCCAGA AGTGCGTAAG GCCAAGCGCG TTGGCTATGC CATTTCAGAT GTAGAGATCC TCAGGTTGTT GGAAGGAATG CCTGATCCAC GTTGGCAATT TGCTGTTCAG CTCTGCAGCG TCTATGGCCT CAGGCCAGAA GAATTGCGAT GGCTACGGAT CAAAAACGGG GCAAAGGGTT CTGAACTGTG GACCATCTAT CAAAAGTCGA TGGGCGGCAG GAAAGGAGAT AAGACAGAAC CGCGTCGCTT GCTACCGCTA CTCGTTCGTG ATCTTGACGG CTCTTCCATT GATTGGAAGT TGCAAGCCCG GCTTCAAGTT GGCGAAAAGT TACCCCCACT GCAGAGCGAT GGCGATGGAG CACAGGCATT AAGGAACTAC CTACGTCGGC GTGAGGTTTG GAGGAGCTTG AAAACTGAGG CGTTGAACAC AGGTGAACAG CTCACGACGT ATTCCTTTAG GCATCGCTAT GCCAAGGCTT CACATGCAGC CGGTTTGCCT GTGGCCAATA TCGCTGAGGC CATGGGGCAC ACGATTGAAG TGCATCTCGG TAGTTACGCC AGGTTCAAAC CAGATGCAAC AGCAGACCTT TATGCGCAGG TGAACGCTTA A
|
Protein sequence | MGKLSNTEPW LKTFRNLVRK TTAEDWWIVK SGNRIRLQVP GVGSKVLPYD WSEEGAAHAL PRIQQIFKRW ADGNITLAEA AQTADTSSSK QQLNFDELIE SYRKFVPNAG DKTWKKNYLP VLRNCRDKFK GTPPRDGEIL CMGCLEQWEQ GSRSRQISRQ KLYGFLTWAV QRGHLKTIYL PPTSLPEVRK AKRVGYAISD VEILRLLEGM PDPRWQFAVQ LCSVYGLRPE ELRWLRIKNG AKGSELWTIY QKSMGGRKGD KTEPRRLLPL LVRDLDGSSI DWKLQARLQV GEKLPPLQSD GDGAQALRNY LRRREVWRSL KTEALNTGEQ LTTYSFRHRY AKASHAAGLP VANIAEAMGH TIEVHLGSYA RFKPDATADL YAQVNA
|
| |