Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07961 |
Symbol | |
ID | 5730127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 700798 |
End bp | 702003 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641285160 |
Product | hypothetical protein |
Protein accession | YP_001550681 |
Protein GI | 159903337 |
COG category | [S] Function unknown |
COG ID | [COG1565] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.610301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00421114 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACAGCA AGCTTGCGCA ATGTCCTGAT TGGCTTGTTC ATCATTTTAC AGAAGCTGGA GGTGGTCTTA GCTTCTGCAA ATATATGAAT TTAGCGCTTA ATGACCCTGC TAATGGCGCA TATTCCACTG GCAAGATAAA TATAGGTATT AAAGGCGACT TTGTTACATC ACCTAGCTTG ACGCCGGATT TTGGGGAGTT GCTTTCATTT CAGTTAATTG AATGGTTAGA TCAATTGATG GCCTCTACTA AATTCTCAGA AAAGTTAGTT GTAATTGATA TTGGCCCGGG TGAAGGTGAT TTGACTTTTG ATTTGATAGC GGCTTTACAA AAATTCTCGC CATCCATGCT GAGAAGAATT CAGTTTATAT TAGTTGAGAT CAATGAGGGA ATGAAATTAC GACAGAAAAA GAAACTTGAG CAGTTCCCCT CTTCATTGAT AAGGTGGGCT AGCTTAGAAG AACTCTCTAG GACTTCACAA GTCGGAGTTA TCATTGCCCA CGAAATACTT GATGCATTAC CTGTCGAAAG AGTTGTCTAT AAGAACAATA AGTTGTATCA GCAAGGAGTT AAATTGATTG AAGATTCTGG TAATTACTTT TTAGACTATT TTGATTTGCC GTTACCGAAT AAGCTGAATC ACTTTATAAG AGATCTAAGC GATTATTGCA AGGTAAATAT TCCTCCTGAT AAAGCAGCCG AGGGATGGTC TACAGAATTA CATACGAATT TAAATAGTTG GTTCGAGAAA ATTTCTAAAT CTTTGGATTA TGGCCTTGTA TTAGTAATAG ACTATGCTTT AGAAGCAAAA CGTTATTACC ATGTTAATAG AGATTTAGGT TCAATAGTCT CTTATAAAAA TCAGTGCTGT ACTTTTAATG TCTTAAAAGA TGCCGGACTA TGTGACATTA CAAGCCATCT TTGTATAGAG TCGATGCAAA TTTATGCAAG TAAGCATAAT TTATTTAGTA AAGGCATAGT AAGGCAAGGT CAAGCTCTGT TGGCTTTAGG ATTGGCTGAT ATTTTAAGTT CTTTAGCACA GGCCGATAAT GTTGACTTGC CTACTGTGCT TAGAAGAAGA GAAGCTCTTC TTAGGTTGGT TGACCCTATT GCTTTAGGGG ATCTCAGGTG GCAGGTTTTT GAGTTAAATA AAAATAGAAA TGTATATAAT TTTGACATAA AATCAAGGCT GTTATATGAA CCTTAA
|
Protein sequence | MDSKLAQCPD WLVHHFTEAG GGLSFCKYMN LALNDPANGA YSTGKINIGI KGDFVTSPSL TPDFGELLSF QLIEWLDQLM ASTKFSEKLV VIDIGPGEGD LTFDLIAALQ KFSPSMLRRI QFILVEINEG MKLRQKKKLE QFPSSLIRWA SLEELSRTSQ VGVIIAHEIL DALPVERVVY KNNKLYQQGV KLIEDSGNYF LDYFDLPLPN KLNHFIRDLS DYCKVNIPPD KAAEGWSTEL HTNLNSWFEK ISKSLDYGLV LVIDYALEAK RYYHVNRDLG SIVSYKNQCC TFNVLKDAGL CDITSHLCIE SMQIYASKHN LFSKGIVRQG QALLALGLAD ILSSLAQADN VDLPTVLRRR EALLRLVDPI ALGDLRWQVF ELNKNRNVYN FDIKSRLLYE P
|
| |