Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12591 |
Symbol | |
ID | 5731220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1132966 |
End bp | 1134513 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641285628 |
Product | hypothetical protein |
Protein accession | YP_001551144 |
Protein GI | 159903800 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.375416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00229564 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATTAT TCCAGCAATT GCTGGTTTTC CCTGCTGCTT TGGGATTGGT TGCGCCTTTG GCTGCAAATG CAGCTGAAGT CAATATGACT GATGTTTCTA AGTATGCGGC GAAAACAGCT AAAAGCATCA AAGCTCCTTC TAGTGCTCAA TTCTCAGACA TCGTTCCTGG GGACTGGGCC TATACATCTC TTAAAAACCT AAGCGCTAGC TACGGTTGTG TAGATAATGC CTACACTCAA AACCTTAATT CAGGTCTTGC TTTAACTCGT TATGAAGCTG CTGCATTAGT AAATGCATGC CTTGATAACG GTCTGGTTGC AAGTGGTGAA GGTCTTTCTT CTGATGCTTC TCTTCTTGCC GATGAGTTTG GCGTTGAGAT GGCAATTCTC AAAGGCCGTG TTGATGGACT TGAGTACAAG CTTAATGAGC TTAGTGCTGG TCAGTTCTCA TCAACTACTA AGTTGGACGG AACGGTAGCT TTCGTTGTAG GTGCTGTTGA CTATGAAAAC AGTGCTGATA CAGCAGTTGA TCATGGCGAC AAGCTTACTG GTACTTACAG CTACAAGCTT GACTTGAACA CCAGCTTCAA TGGTAATGAC CGCTTGCATG CAAGCATCAT GACCGGAAAC ATGGATGGAA ATAATCCATG GGGCGATAAA GATGGTGGTA CTTACCTAGC TGTCGCTAAC GACAACGAGC AAGTTCTTGA GATAGACAAG CTCTGGTATG AGTGGACCAA GGATGACCTT AAATTCTGGG CCGGTCCAAA GATCGAGAGC AACCAGATGT TGGCATCTTC TCCATCTATC TATAAGCCTG TTCAGAAGCA ATTTGCCTTC GGCGGTAATA CTGCTGCTTA TGCTTCAAGC ACAACAACAG GTTTTGGTGT CGCTTGGACA CAGCCAACTG AGGCTGATAG AAAGTGGACA GTTAGTGCTA ACTACGCCTC TATTGGTGGT GACGATGCTA CCAAGGGTAT CCTTACCGAT GAGCAAACTA AGTTCCTGAC TCAGGTTACT TACGGTGGTC AGAGATGGCA GATCGCTGCT GCTGTTGCTC GCCATGGTTG CGCAGGCCAG GATGCAAACA GCTCTTGTCA CGCATGGTCT GACCTCTATG CAACTGCTGC AGGCGACAAT GCAACTGGAG AAGGCGAGAT GGCCTATTCA TTGCGCTATT ACTGGAAGCC AGTAGAAACA GGTGCAATGC CTTCTATTCA GCTCGGTATG GATTACCGTG AGCTAGATGA TGCAGCTGAC ACAGAAGTTC AGAGTACTGC TGCTTGGATG GCTGGTCTTA CTTGGGATGA TGCTTGGATC GATGGCAACA GAGCTGGAAT CGCTTTTGGT TCTCGTGAGC ATGCTACCGA TTATGCAGGT TCCGGTGACG ACGAAGCTGA TGACAACCTA GTTTGGGAAG CCTATTACGA TTACCAGTTG ACTGATGGAA TCACCATCAC TCCAGCTCTA TTCGGTGGCT CTCATGTCTA TGACGGTTCT GACGATGACA TCTTTGGTGC TCTAGTTCAG ACTGTATTTA AGTTCTGA
|
Protein sequence | MKLFQQLLVF PAALGLVAPL AANAAEVNMT DVSKYAAKTA KSIKAPSSAQ FSDIVPGDWA YTSLKNLSAS YGCVDNAYTQ NLNSGLALTR YEAAALVNAC LDNGLVASGE GLSSDASLLA DEFGVEMAIL KGRVDGLEYK LNELSAGQFS STTKLDGTVA FVVGAVDYEN SADTAVDHGD KLTGTYSYKL DLNTSFNGND RLHASIMTGN MDGNNPWGDK DGGTYLAVAN DNEQVLEIDK LWYEWTKDDL KFWAGPKIES NQMLASSPSI YKPVQKQFAF GGNTAAYASS TTTGFGVAWT QPTEADRKWT VSANYASIGG DDATKGILTD EQTKFLTQVT YGGQRWQIAA AVARHGCAGQ DANSSCHAWS DLYATAAGDN ATGEGEMAYS LRYYWKPVET GAMPSIQLGM DYRELDDAAD TEVQSTAAWM AGLTWDDAWI DGNRAGIAFG SREHATDYAG SGDDEADDNL VWEAYYDYQL TDGITITPAL FGGSHVYDGS DDDIFGALVQ TVFKF
|
| |