Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_09171 |
Symbol | |
ID | 4778730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 833380 |
End bp | 834948 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640086426 |
Product | hypothetical protein |
Protein accession | YP_001016933 |
Protein GI | 124022626 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.127848 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCAAAG GTGCTCTGGC CCTGGTTCTC CATGCCCATC TTCCATATGT GCGATCGGCA GAGCCCGGTT CATTGGAAGA AGACTGGTTT TTTCAAGCCC TAATTGAGTG CTATTTACCT CTTCTCCAGG TTCTGGAAGA AGCTGCCGCA GCACCCAACC AACACCCCAG GCTCACGATC AGCCTTTCTC CGACCCTGCT CTCCCTACTC AGTGACGACG ATCTAAAACA CCGATTCCCC GCCTGGCTCG CTGTTCGTCT GGACCTCCTA ACCCAAACGG CCTCAGACCT ACAACCTGCA GCGGACCACC TCGCTGAGAT CATCCAACGA AATCTGCATC AATGGTTGGC ATGCGAAGGC GATCTCATTG GTCGCTTTGC CCAACTTCAA AGGTCAAAGG TTGTCGACCT GCTCACCTGC GGAGCCACCC ATGGCTACAT GCCATTACTG AGGGAGCACC CTGAGGCAGT GCGTGGTCAG TTGCGCACTG CCGTACGCGA GCACCACCGC CTACTAGGAG AGCAGCCCCT GGGAATCTGG CTGCCGGAAT GTGCCTACTA CGAAGGGCTG GACCGTTGGA TACTCGATGC TGGACTGCGC TACACCGTGC TCGACGGGCA TGGCCTCCTT CATGCCACTC CTCGTCCGCG TTATGGCGTA TATGCCCCAA TCTGTAGCCG AAATGGCGTC GCCTTCTTCG GCAGAGACAG TGATGCCACG CTTCCTGTCT GGTCAGCCCA GCAAGGGTAT CCAGGCGACC CTTATTACCG TGAATTTCAT AGGGATCTTG GTTGGGACCT ACCAATCGAA CAACTCCATG ACATCGGGCT AAAGGAGCCC AGACCCTTAG GGCTGAAACT GCATCGAGTG ACAGACCAAA GGTCACCCCT TGATGCCAAA GAAGTTTATG AACCCGCCAT AGCTTGCGCA CTTACCAAAG AACACGCTCA GCTCTACTTA AAGGGTCGCC GTATCCAACT CGATCAATTA ACTAACACCA TGGCCATCGA GCCATTGTTA GTGGCTCCCT TCGACGCAGA GCTCTTTGGA CACTGGTGGT TTGAGGGGCC AACTTTCCTT GCCGAAATCT TCCGCCAGGC CAGCAAGGAG CAGGTGGATT TCACCAGGCT TCGAGACGTG CTCACATCAA ACCCCCAACT CCAACTTTGT GAACCATCTC CCTCAAGCTG GGGGCAAGGT GGCTACCACG ACTATTGGCT CAATGACAGC AATGCATGGG TGGTTCCTGA GTGGAGCCGA GCCGGAAAAG CAATGATGGA GAGATGCAGC CTAGGAGTGG CCCGAGAATC CGACCTACGG CTGCTGCAAC AGGCTGCTCG AGAACTTCTC CTGGCCCAGT CCTCCGACTG GAGTTTCATT CTGCGAGCAG GCACAACCAC GGAGCTGGCG AAGGAACGCA TCCATCGCCA CCTCAACCGT TTCTGGCAAT TAATGCAGGC CATCAATGAC AAGCAACATC TGCCCGAAGA CTTGCTGATC ACACTCGAAT CGGAAGATGG CCTCTTCCCA TTCATTCAAG CGACGGACTG GGCTCGCATT CGTGACTAG
|
Protein sequence | MAKGALALVL HAHLPYVRSA EPGSLEEDWF FQALIECYLP LLQVLEEAAA APNQHPRLTI SLSPTLLSLL SDDDLKHRFP AWLAVRLDLL TQTASDLQPA ADHLAEIIQR NLHQWLACEG DLIGRFAQLQ RSKVVDLLTC GATHGYMPLL REHPEAVRGQ LRTAVREHHR LLGEQPLGIW LPECAYYEGL DRWILDAGLR YTVLDGHGLL HATPRPRYGV YAPICSRNGV AFFGRDSDAT LPVWSAQQGY PGDPYYREFH RDLGWDLPIE QLHDIGLKEP RPLGLKLHRV TDQRSPLDAK EVYEPAIACA LTKEHAQLYL KGRRIQLDQL TNTMAIEPLL VAPFDAELFG HWWFEGPTFL AEIFRQASKE QVDFTRLRDV LTSNPQLQLC EPSPSSWGQG GYHDYWLNDS NAWVVPEWSR AGKAMMERCS LGVARESDLR LLQQAARELL LAQSSDWSFI LRAGTTTELA KERIHRHLNR FWQLMQAIND KQHLPEDLLI TLESEDGLFP FIQATDWARI RD
|
| |