Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11211 |
Symbol | |
ID | 5731312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1024742 |
End bp | 1026817 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285489 |
Product | HD superfamily hydrolase |
Protein accession | YP_001551006 |
Protein GI | 159903662 |
COG category | [R] General function prediction only |
COG ID | [COG1480] Predicted membrane-associated HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.465654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.585119 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTAAATA GCCAATCGCC AAGAAGGGTC ATAGTTCCAT GGGCATTAGC TGATAGAACA GGTTTGCTTA TGGTTTGCCT AGTAATAGCA ATAATTTCAA GCTATAAATT ATTAGATGTT CCTGACCTTA AGCCTGGTGA TATTGCCCCA TTTAACGCGA CTGCACCAAA AACCGCTTTT GTAATAGATA CTGCAGCTTT ACAACAGCAA AGAAAAGACC TTATACCAAG AACCTCAGTA CAAGTAATTG ACAAAGGAGA GTCTCAAAAA ATTCAAGAAG AGTTAATTAA ACAACTGGGC TATTTAGAAA AAGTTTCATC AGGCACTAAA GGTTCTAATT CTTTCAGGAT AGGTCCAGTA AACCTAACTC CAAAAGAAAG GGAGTGGCTG GCGAACCAAA CTACTAAATC AAGAAAACAA TGGGAAGGCG AAATAATTCT CCTGTCACAA AAAATGCTAA GTCAGGGCCT TATAAAGACA TTGGCTCATG ACCAAATACT AGAATCAGCT AAATTCCAAT TATCGGCCCA AGCTGAAAGC AATAATCCTT CTATAAACCT TGGCAGCAAA TTAATTGCAA ATACTTTATT TGGTAAAACA AATCTTCAGC ATGATGCTGC CAGAAGCCAG CAATTACTTG AAGAACTTAT AACCAAACAA GGAATCCCTG AGATAGAAGT TAAGAAAGGA GATCTAATCA CAAAAAAAGG GGAAAGAATC ACTGCTAAAG GATATGTTGT TCTTGATCAT TTTGGCCTAA TCAGAAGAAG TGCTAGACCA CTGGAATGGT TCGGAAAATT TAGTGAGGCA CTTGCAAGTT GTTTAGTTCT TTTAATGATA ATGAGAAGAG ATAAACCCTC TTTGAAACCT AAGAATGGAC TATTACCACT TGGTCTCCTA TTAATTGTTC AAGCAAGTAA AGACTGGTTT GGAGCTGCAA TCAGTCCTTT GCAAATTCTT GTTCCACCAA CATTACTTCT TTCCCAAGGC ATTAGTACCA CAACAGCCTT GGGATGGATG GCAATCGCTA GTTTGCTATG GCCCGTTCCC GTAAGTGGCA TTGGAGAAGG ACGTTTGATA ATTGCAGTCC TTACTTCAAC CTTAGTAGCG ATACTTGGTG GGCGAATGAG AAGCAGAGCC CAGCTTCTTC AGGTAGTAGT TTTTGTACCT TTTAGTGCAT ACCTTAGTCA GTGGGTTCTG TTAAGGAGTC AACTATTTTC TTCAGGTGGG GTGTGGAGGA AGCTTTCGCC TAATTCAGAG ACTTTGCTTA CTGAAGCTTT GGTTTTAGGG GCAATTTTGA TGTTTACTAT TTTATTAATA CCAATTATTG AAAATACTTT TGGCCTATTA ACACGCGCAA GGCTCATGGA AATATCCGAT CAAGAGAAAC CTCTTTTAAG AAGATTGTCT AAGGAAGCAC CAGGAACATT TGAACACACA TTGATGATCT GTGGACTAGC CGAAGAAGGT GCAAGAAGTA TTGGGGCTGA CGTTGACCTA ATTAGATCAG GAGCTCTATA TCATGACGTA GGGAAACTAC ATGCGCCAGA ATGGTTTATA GAGAATCAAG AGAATGGAGT CAACCCGCAT GAAACACTTG ACGACCCACT TAAAAGTGCT GATATTTTAC AAGCGCATGT AGATGAAGGC CTGAAGCTAG CTAGACGTTA CCGACTACCA ACTCCTATAG CTGATTTCAT ACCAGAACAC CAAGGGACAC TAAAGATGGG CTACTTCCTT CATAAGGCGA AAGAGAAGAA TCCCTCAGTA GAAGAAAAGC GTTTTAGGTA TAAAGGCCCC GTTCCACGCT CTAAGGAAAC TGCAATTCTT ATGATGGCCG ATGGATGCGA AGCAGCTCTT AGAGCTTTAG GGCCACAAAG TACTGATATA GATGCAGAGT CAACAATTAG AAAAATTATT CGCTCTCGTA AATTAGATGG ACAATTGCTA AAAAGTAGTA TTAGCAATTC TGAGATTGAA TTGGTTATAA GAGCCTTTAT AAGTGTATGG AGAAGGATGA GGCATCGGCG TATTAAGTAT CCTATTAGTT CATTTAAAGC ATCATATCCA GCTTAA
|
Protein sequence | MLNSQSPRRV IVPWALADRT GLLMVCLVIA IISSYKLLDV PDLKPGDIAP FNATAPKTAF VIDTAALQQQ RKDLIPRTSV QVIDKGESQK IQEELIKQLG YLEKVSSGTK GSNSFRIGPV NLTPKEREWL ANQTTKSRKQ WEGEIILLSQ KMLSQGLIKT LAHDQILESA KFQLSAQAES NNPSINLGSK LIANTLFGKT NLQHDAARSQ QLLEELITKQ GIPEIEVKKG DLITKKGERI TAKGYVVLDH FGLIRRSARP LEWFGKFSEA LASCLVLLMI MRRDKPSLKP KNGLLPLGLL LIVQASKDWF GAAISPLQIL VPPTLLLSQG ISTTTALGWM AIASLLWPVP VSGIGEGRLI IAVLTSTLVA ILGGRMRSRA QLLQVVVFVP FSAYLSQWVL LRSQLFSSGG VWRKLSPNSE TLLTEALVLG AILMFTILLI PIIENTFGLL TRARLMEISD QEKPLLRRLS KEAPGTFEHT LMICGLAEEG ARSIGADVDL IRSGALYHDV GKLHAPEWFI ENQENGVNPH ETLDDPLKSA DILQAHVDEG LKLARRYRLP TPIADFIPEH QGTLKMGYFL HKAKEKNPSV EEKRFRYKGP VPRSKETAIL MMADGCEAAL RALGPQSTDI DAESTIRKII RSRKLDGQLL KSSISNSEIE LVIRAFISVW RRMRHRRIKY PISSFKASYP A
|
| |