Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22171 |
Symbol | |
ID | 4778053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1966991 |
End bp | 1967803 |
Gene Length | 813 bp |
Protein Length | 270 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640087733 |
Product | haloacid dehalogenase/epoxide hydrolase family protein |
Protein accession | YP_001018217 |
Protein GI | 124023910 |
COG category | [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCAAC TGTTGCTGAG GGGGACGCCC ATCGGCAAGA TCCAAGGGGT GCTGTTCGAC AAGGATGGCA CCCTTTGCCA TAGCGAACCC CACCTGCTCA CTCTGGCAAA AGGAAGAATC GAACAAGCCA TTCGTAGGTT TCACCGAGGG AATGCCAGCG AAAGTGTGGT TTGCAAAATT CAGGAACTTC TCTCTGCTGC CTATGGACTC AACGCTGAAG GACTCGATCC AGGCGGAACC ATCGCCGTAG CGTCAAGGCA TCACAACCTA ATCTCCACAG CAACGGTGTT CTGTCTGCTC GGAGAAGGAT GGCCACAGGC TCTTGCTCTG GCCAATGAAG TGTTCGCAGC TGTGGATGCT CTTGAGAATG AAGTGCCCTG CCTAGCAACA ACGAGAACTC TGCTACCAGG TGCCCTTTCG CTCTTGCAAG CGCTGCGACA ACAAGGGGTG ATCTGCGCCG TGATTAGCAA TGACAGCGCC TCAGGGATTG AAACCTTCCT GAACCAAAAC AATCTTCATG ACACGGTGAC CGAGCTCTGG AGCGCTGAGC ATCAACCAGC TAAGCCCAAT CCAAATGCCG TCAAAAGACT CTGCCATTTA ATGGGATTAG CCCCCGCCCA GTGCGCCTTG ATTGGTGATG CCGATTCCGA TTTACAGATG GCTCGCCAGG CTGGTATAGG TCTCAGCCTT GGTTACATGG CCGGCTGGAA TCAACCCCCA ACACTGACGA ACTACCACCA CCTCATCCAC CACTGGAATG ATCTAGTCGT CAAGGCAGAC CCTAAAATTA CGCATAACTT CAGTTCTCCA TGA
|
Protein sequence | MPQLLLRGTP IGKIQGVLFD KDGTLCHSEP HLLTLAKGRI EQAIRRFHRG NASESVVCKI QELLSAAYGL NAEGLDPGGT IAVASRHHNL ISTATVFCLL GEGWPQALAL ANEVFAAVDA LENEVPCLAT TRTLLPGALS LLQALRQQGV ICAVISNDSA SGIETFLNQN NLHDTVTELW SAEHQPAKPN PNAVKRLCHL MGLAPAQCAL IGDADSDLQM ARQAGIGLSL GYMAGWNQPP TLTNYHHLIH HWNDLVVKAD PKITHNFSSP
|
| |