Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12581 |
Symbol | |
ID | 5730441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1131220 |
End bp | 1132767 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641285627 |
Product | hypothetical protein |
Protein accession | YP_001551143 |
Protein GI | 159903799 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00206869 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATTAT TCCAGCAATT GCTGGTTTTC CCTGCTGCTT TGGGATTGGT TGCGCCTTTG GCTGCAAATG CAGCTGAAGT CAATATGACT GATGTTTCTA AGTATGCGGC GAAAACAGCT AAAAGCATCA AAGCTCCTTC TAGTGCTCAA TTCTCAGACA TCGTTCCTGG GGACTGGGCC TATACATCTC TTAAAAACCT AAGTGCTAGC TACGGTTGTG TAGATAATGC CTACACTCAA AACCTTAATT CAGGTCTTGC TTTAACTCGT TATGAAGCTG CTGCATTAGT AAATGCATGC CTTGATAACG GTCTGGTTGC AAGTGGTGAA GGTCTTTCTT CTGATGCTTC TCTTCTTGCC GATGAGTTTG GCGTTGAGAT GGCAATTCTC AAAGGCCGTG TTGATGGACT TGAGTACAAG CTTAATGAGC TTAGTGCTGG TCAGTTCTCA TCAACTACCA AGTTGAATGG TAAGGCAGCT TTCGTTGTAG GTGCTGTTGA CTATGAAGAC AGTGCTGATA CAGCAGTTGA TCATGGCGAC AAGCTTACTG GTACTTACAG CTACAGGCTT GATGTAAATA CCAGCTTCAA TGGTAATGAC CGCTTGTATA CAAGAATCAT TACTGGAAAC ATGGATGGAA ATAATCCATG GGGCGATAAA GATGGTGGTA CTTACCTAGC TGTCGCTAAC GACCACGAGC AAGTTCTTGA GGTAGATAAG CTCTGGTATG AGTGGACCAA GGATGACCTT AAATTCTGGG CCGGTCCAAA GATCGAGAAC TACTACATGT TGGCAGCTTC TCCATCTATC TATAAGCCTG TTCAGAAGCA ATTTGCTTTA GGTGGTAACA CTGCTGCTTA TGCTTCAAGC ACAACAGCAG GTTTTGGTGT CGCTTGGACA CAGCCAACTG AGGCTGATAG AAAGTGGACA GTTAGCTCTA ACTACGCCTC TGTTGGAGCT GATGATGCTA CCAAGGGTAT CCTTACCGAT GAGCAAACTA AGTGGCTGAC TCAAGTTACT TACGGTGGTA AGAGATGGCA GATCGCTGCT GCTTATGCTC GCCATGGTTG CGCAGGCCAG GATGCAAACA GCTCTTGTAA AGCATGGTCT GACTACTATT CAACTGCTGC AGGCGACAAT GCAACTGGAG AAGGCGAAAA TGCTTATTCA TTGCGCTATT GGTGGAGACC AGCTGAAACA GGTATAATGC CTTCTATTCA GCTCGGTATG GATTACCGTG AGCTAGATGA TGCAGCTGAC ACAGAAGTTC AGAGTACTGC TGCTTGGATG GTTGGTCTTA ACTGGAAAGA CGCTTGGATC GATGGCAACA GAGCTGGAAT CGCTTTTGGT TCTCGTGAGC ATGCTACCGA TTATGCAGGT TCCGGTGACG ACGAAGCTGA TGACAACCTA GTTTGGGAAG CCTATTACGA TTACCAGTTG ACTGATGGAA TCACCATCAC TCCAGCTCTA TTCGGTGGCT CTCATGTCTA TGACGGTTCT AACGATGACA TCTTTGGTGC TCTAGTTCAG ACTGTATTTA AGTTCTGA
|
Protein sequence | MKLFQQLLVF PAALGLVAPL AANAAEVNMT DVSKYAAKTA KSIKAPSSAQ FSDIVPGDWA YTSLKNLSAS YGCVDNAYTQ NLNSGLALTR YEAAALVNAC LDNGLVASGE GLSSDASLLA DEFGVEMAIL KGRVDGLEYK LNELSAGQFS STTKLNGKAA FVVGAVDYED SADTAVDHGD KLTGTYSYRL DVNTSFNGND RLYTRIITGN MDGNNPWGDK DGGTYLAVAN DHEQVLEVDK LWYEWTKDDL KFWAGPKIEN YYMLAASPSI YKPVQKQFAL GGNTAAYASS TTAGFGVAWT QPTEADRKWT VSSNYASVGA DDATKGILTD EQTKWLTQVT YGGKRWQIAA AYARHGCAGQ DANSSCKAWS DYYSTAAGDN ATGEGENAYS LRYWWRPAET GIMPSIQLGM DYRELDDAAD TEVQSTAAWM VGLNWKDAWI DGNRAGIAFG SREHATDYAG SGDDEADDNL VWEAYYDYQL TDGITITPAL FGGSHVYDGS NDDIFGALVQ TVFKF
|
| |