Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11251 |
Symbol | |
ID | 5730301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1029604 |
End bp | 1031166 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641285493 |
Product | hypothetical protein |
Protein accession | YP_001551010 |
Protein GI | 159903666 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.448443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.521011 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACAAAAG GCAAACTAGC TCTAGTACTT CATGCACATC TCCCATATGT GAGATCTGCT AACCCTGGCT CTCTAGAAGA AGACTGGTTC TTTCAGGCAC TTTTGGAATG TTATTTACCT TTACTTGAAG TAATCGAAAA CTCAGTAAAT TCAAACGAAC AAAATCCGAA AATCACAATA TCCCTTTCAC CAACTCTTCT TTCTTTATTA AGTGATCAAG ATTTAAAAAA TCGTTTTCCA GACTGGGTAA AGTTAAGACT ATCTCTCTTA GAAAAAAGCA CTAAATCCCA AAATGAAGCA GGCAGATTCT TAAAGAAAAA CATTATTAAA CAATTAAAGA ATTGGTCCTC TTGTGATGGA GAAATTATTG ACAGATTCTC GCAACTAGAA AAGCTAAAAG CTCTTGACCT AATGACATGT GCCGCAACAC ATGGTTATCT GCCTCTTTTG CGAGAAACTC CTGAGGCTGT CAAAGGGCAA CTAAAGACTG CCATTCGAGA ACACCAAAGA TTTTTTGGCA AAAAACCTAA AGGTATTTGG TTGCCTGAAT GCGCTTACTA CGAAGGGTTA GACCATTTGA TGCGTGAATG TGATTTACGT TATTCAGTTT TAGATGGTCA TGGAATACTC CATGGGAAGC CAAGACCTAA ATATGGGATA TATGCTCCTG TTTGTACCAA AAATGGTATA GCTTTTTTTG CTAGAGATAG TGAATCAACT CTTCCTGTTT GGTCAGCCAG AGAAGGATAT CCAGGCAATC CTGAATATCG AGAGTTTCAC AAAGATTTAG GCTGGGATAT ACCGTTTGAA GAACTTGCGA AACTAGGCCT TGAAGGGAAT CGGCCTCTAG GCTTAAAGCT TCATAAAGTT ACTAGTAAGC AATCAATTAG AGAAAAAGAT TTATATGATC CAACATCAGC ATCCAAGCAG GTAAAAGAAG ATGCAAAAGA CTATTTAATA GGAAGAAAAA AGCAACTAAT AAAGCTAATC GATCAGACTG GGATAAATCC TTTATTAATA GCTCCTTTTG ATGCGGAGCT ATTTGGCCAT TGGTGGTTTG AAGGGCCAAT GTTTTTAGCT GAGATTTTTA AGCAAGCAAA GAATCATAAA GTCCAATTTA CGACTTTAAA GGATTACCTC AGCTCAAAAA AAGAGTTGCA GTTGTGTGAA CCCTGTCCTT CAAGTTGGGG GCAAGGTGGT TTCCACAATT ATTGGCTAAA TGAAACCAAT GCATGGGTAG TCAATGAATG GAGCAAGGCG GGGAAAGCAA TGGTTGAATG CTGCAGTAAT GGCGTCTCTA ATCAATTAAA TCTAAGGATC CTCCAACAAG CAGGAAGAGA GCTTCTTCTT GCTCAATCTT CTGACTGGAG TTTTATTCTT AAAGCAGGAA CAACTACAGA ATTAGCTAAA GAAAGAATTC ATCGTCACCT AAATAGGTTT TGGAATTTAA TGCAAGCAAT TGAATCAAAA AATGGAATGT CAGAGTCTGA ACTAAAAGAG TTAGAACTAG AAGATTCGAT ATTCCCTTTA ATCCATGCAA ATGATTGGTA CCAAATCTAT TAA
|
Protein sequence | MTKGKLALVL HAHLPYVRSA NPGSLEEDWF FQALLECYLP LLEVIENSVN SNEQNPKITI SLSPTLLSLL SDQDLKNRFP DWVKLRLSLL EKSTKSQNEA GRFLKKNIIK QLKNWSSCDG EIIDRFSQLE KLKALDLMTC AATHGYLPLL RETPEAVKGQ LKTAIREHQR FFGKKPKGIW LPECAYYEGL DHLMRECDLR YSVLDGHGIL HGKPRPKYGI YAPVCTKNGI AFFARDSEST LPVWSAREGY PGNPEYREFH KDLGWDIPFE ELAKLGLEGN RPLGLKLHKV TSKQSIREKD LYDPTSASKQ VKEDAKDYLI GRKKQLIKLI DQTGINPLLI APFDAELFGH WWFEGPMFLA EIFKQAKNHK VQFTTLKDYL SSKKELQLCE PCPSSWGQGG FHNYWLNETN AWVVNEWSKA GKAMVECCSN GVSNQLNLRI LQQAGRELLL AQSSDWSFIL KAGTTTELAK ERIHRHLNRF WNLMQAIESK NGMSESELKE LELEDSIFPL IHANDWYQIY
|
| |