Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_17441 |
Symbol | |
ID | 4778113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1524788 |
End bp | 1526464 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640087251 |
Product | hypothetical protein |
Protein accession | YP_001017751 |
Protein GI | 124023444 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGAATT TGGCGGACAG CACATCCACT CAGATCCTGT TGCTGGCTCC CGATCTGCTC GGTGAATCCT TGGCGTTGCA GCTCAGCAGT GCAAACCCAA ACCTGGATGT CATTCTGCGG ACGGACCAGC TGAGCCGTCA TCCAGTCCTA GTGATCTTGT CGGTGGAGAG TCTCGAAACT CTCAGCACAT TGCAACTGGA ACTGAAAAGA CTTCAGGAGC ATTGGCAACC TGCGCCAGTA ATGCTGATCC TTCCGGCCCA GCTTCGTTTC AATGCCAACG AGTTGTTGAG CCTCGATTGT CCAGGCCTAC TTCAAGACCC TGATCTGGCC ACATTGCAGG ATGCCATCAC CACCCTTTGT GCCGGGGGCA GAGTTGTGAG ACTCAATGCT GCCCCCGTCT CCCAAGACAG CATTCCCCAA GCAACGATGG GACTTGGTCA GTGGCTGTTG GTGAGTGGTC TCCAGCAGAT CGACAACGAT CTCAGGTTGA TCGAAGCATT CCTCAACCCA CCGCCCCAAA ATGAACTGTT TCGTCTTTTG ATGGAAGGGC GCCAGCGGGA ACTCCGCAGT GCAAGAGATT TTTTGTTATG GATTTGGGGC CCGCTGCAAG TGGGACTACG CAACCCTTTC CCCCCAAACC GGCCAGCCCA AAAGGCACGC ATCAATTTCG ATTTCGATGC TTCAAGACAG ATCTCAGCAG AAGCTGCTGG AACGGTGATC TGCCTCACTG AACGGAACGC AGTGGCGGTA TGGGGAGCGA TCCGTCAGCG GCTTAGCGAT TCCGTAGAAA GAGGACTCAG AAATTCAACA GGCAGCCTGC TAGCGATTGA AAGCCTCAAT CCTGAGCGAC GTCGTGATCT ACTCCTTGCC CTTTTGAACC AATTGGATCA AGTGATGAAA AGGCTGCGTC AAGCCAACAG TACCGAGACC CCACTCAATG ACTCCTGGCT GACGCTCGAA TCTGAACTAC GTGAACAAGC CCTGCGATCC ATGGCAGGGA ATTACGTCAG ACTTCCTCGA GGTGGTGAAC TCAAGCCAGT GGCAGATCAA TTGCTTGCCA CTGCCGATCT CAAGGGAATT GATCAAGAAC TTCCAGATCC CCAGAGAATG CTGGCTCCAT TGCTGCTCGA CAGACCCGTG CTTGTTGAGG GGCAGCTGCT GCCGGCAGAT GCCCCTCGGG CCTTACTGCA ACTCGAAATG TTGGTGGGTA ACTGGCTCGT ACGAACCGCT GAAATAATCA GTGCCGAAGT TCTCGGAACC TGTGGTGAGT GGCCCGAACT ACGCCGATTC CTACTTAACC AGCACTTGAT CTCCACACGA GAACTCGAGC GATTGCGTAA TCAACTCAAT AGTCAGGCCC GTTGGCAAAA CTGGATTCAA AGACCAATCC AGCTCTACGA AAGTAAACGA CTCCTCTACA GGTTGCGAGA CGGCATTATC GAGCCATTAC TGCTCACCGA ACCTCGTGAT GAAGAACTCA GCCAGCTTGG TTGGTGGCAG CAGCAAGTTG CTCTATTGCT CGAAGCCCGC GATGCTCTGG CACCCTCAAT GCAATCCCTG ATCAAACGCA TTGGTGATCT CATGGTGGTC GTGCTTACTC AGGTATTAGG CCGTGCTATT GGTCTGGTTG GACGAGGAAT CGCGCAGGGA ATGGGACGCA GCCTGAGAGG TGGCTAA
|
Protein sequence | MVNLADSTST QILLLAPDLL GESLALQLSS ANPNLDVILR TDQLSRHPVL VILSVESLET LSTLQLELKR LQEHWQPAPV MLILPAQLRF NANELLSLDC PGLLQDPDLA TLQDAITTLC AGGRVVRLNA APVSQDSIPQ ATMGLGQWLL VSGLQQIDND LRLIEAFLNP PPQNELFRLL MEGRQRELRS ARDFLLWIWG PLQVGLRNPF PPNRPAQKAR INFDFDASRQ ISAEAAGTVI CLTERNAVAV WGAIRQRLSD SVERGLRNST GSLLAIESLN PERRRDLLLA LLNQLDQVMK RLRQANSTET PLNDSWLTLE SELREQALRS MAGNYVRLPR GGELKPVADQ LLATADLKGI DQELPDPQRM LAPLLLDRPV LVEGQLLPAD APRALLQLEM LVGNWLVRTA EIISAEVLGT CGEWPELRRF LLNQHLISTR ELERLRNQLN SQARWQNWIQ RPIQLYESKR LLYRLRDGII EPLLLTEPRD EELSQLGWWQ QQVALLLEAR DALAPSMQSL IKRIGDLMVV VLTQVLGRAI GLVGRGIAQG MGRSLRGG
|
| |