Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21871 |
Symbol | |
ID | 4777806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1945062 |
End bp | 1946303 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640087702 |
Product | HD superfamily phosphohydrolase |
Protein accession | YP_001018187 |
Protein GI | 124023880 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.406295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGC GCACCTATCA CGACCCTCTC CATCGCGGCA TCACCCTCAA TGCCAGCGAT GCAGCAGAGG CCATGGTGAT GCAGTTGATC GATGCGGCGC CATTCCAGCG GCTGCGCCGC ATCCGTCAGC TTGGGCCAGC CTTCCTCACC TTTCATGGAG CTGAGTCCAG TCGCTTCACC CATTCCCTTG GTGTGTTTCA TCTCGCACGC CTTGCTCTGG AAAGGTTGCT GCAGTTCGAC TCAGGCCTAG AGGAACATCG CGGACTGCTC TATGGAGCGG CCCTACTGCA TGACCTCGGC CATGCCCCCC TAAGCCATAC CGGAGAGGAA ATATTTGGTC TGGACCATGA GACCTGGTCG GCGCGTTTAG TCAGAGAACA TCCCGCCGTG CGTGCGCCCC TTGAGGCCTA CGCCCCGGGC ACTGCAGATG GAGTAGCCAA TCTTCTTGAA CGGGGCAGTT CACCCCACAG CGTGATCAAG GCTCTGGTTA GTAGCCAACT GGACTGCGAC CGACTCGACT ATCTCTTGCG AGACAGCTAC AGCACAGGCG CCCATTACGG CCAGCTCGAC CTCGAGCGAA TCCTCTCCGC CCTCACCCTG GCGCCTGATG GAGCCATGGC CATTCATCCA AAAGGCCTAA TGGCAGTTGA GCACTACCTG GTCGTACGCA ACCTGATGTA CCGCAGTGTC TATAACCACC GTCTCAATGT TGTTTGCAAC TGGCTGCTTG AGCAGGTGGT GCGCACCGCT CGCCAACTCG GAGCTGCTCA TGTTTGGGCC GACAAGATCA TGGCCACCTG GCTCTGGAGC CTCAATCAAC TCGACCTCGA CACATTCCTC GCCAATGATG ACCTGCGCAC CGGCTATCAC CTCCTGCGTT GGCAAGACGA GGGGCCTGCA CCTCTAGCCG AACTCTGCAA ACGCTTCCTC AATCGCCACC TACTGAAAGC CCTTGCAGTG GAACATCTGA GCCATAGCAA CCAGCTAGAG GTCCTCACTC TGACTCGGCA ACTAGCTGAA CGCCAAGGCT TCGATCCAGC GCTCTGCTGT GGATTACGCC ATCAGCAACA GCGCGGTTAT CACCCTTACA AAGGAGGGTT GCGTCTTTGG GATGGAAGGC AATTGCGAGC CCTCGAGCAA GCCTCTCCTC TGGTCGCCAG CTTGATTACC CCAGCCGAAT CTTCATGGTT GATTTATCCC CGAGAGATTC ATGGGGAGCT TCAAGCTGCG CTGGCAACCT AG
|
Protein sequence | MSLRTYHDPL HRGITLNASD AAEAMVMQLI DAAPFQRLRR IRQLGPAFLT FHGAESSRFT HSLGVFHLAR LALERLLQFD SGLEEHRGLL YGAALLHDLG HAPLSHTGEE IFGLDHETWS ARLVREHPAV RAPLEAYAPG TADGVANLLE RGSSPHSVIK ALVSSQLDCD RLDYLLRDSY STGAHYGQLD LERILSALTL APDGAMAIHP KGLMAVEHYL VVRNLMYRSV YNHRLNVVCN WLLEQVVRTA RQLGAAHVWA DKIMATWLWS LNQLDLDTFL ANDDLRTGYH LLRWQDEGPA PLAELCKRFL NRHLLKALAV EHLSHSNQLE VLTLTRQLAE RQGFDPALCC GLRHQQQRGY HPYKGGLRLW DGRQLRALEQ ASPLVASLIT PAESSWLIYP REIHGELQAA LAT
|
| |