Gene P9303_21871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21871 
Symbol 
ID4777806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1945062 
End bp1946303 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content58% 
IMG OID640087702 
ProductHD superfamily phosphohydrolase 
Protein accessionYP_001018187 
Protein GI124023880 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.406295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGC GCACCTATCA CGACCCTCTC CATCGCGGCA TCACCCTCAA TGCCAGCGAT 
GCAGCAGAGG CCATGGTGAT GCAGTTGATC GATGCGGCGC CATTCCAGCG GCTGCGCCGC
ATCCGTCAGC TTGGGCCAGC CTTCCTCACC TTTCATGGAG CTGAGTCCAG TCGCTTCACC
CATTCCCTTG GTGTGTTTCA TCTCGCACGC CTTGCTCTGG AAAGGTTGCT GCAGTTCGAC
TCAGGCCTAG AGGAACATCG CGGACTGCTC TATGGAGCGG CCCTACTGCA TGACCTCGGC
CATGCCCCCC TAAGCCATAC CGGAGAGGAA ATATTTGGTC TGGACCATGA GACCTGGTCG
GCGCGTTTAG TCAGAGAACA TCCCGCCGTG CGTGCGCCCC TTGAGGCCTA CGCCCCGGGC
ACTGCAGATG GAGTAGCCAA TCTTCTTGAA CGGGGCAGTT CACCCCACAG CGTGATCAAG
GCTCTGGTTA GTAGCCAACT GGACTGCGAC CGACTCGACT ATCTCTTGCG AGACAGCTAC
AGCACAGGCG CCCATTACGG CCAGCTCGAC CTCGAGCGAA TCCTCTCCGC CCTCACCCTG
GCGCCTGATG GAGCCATGGC CATTCATCCA AAAGGCCTAA TGGCAGTTGA GCACTACCTG
GTCGTACGCA ACCTGATGTA CCGCAGTGTC TATAACCACC GTCTCAATGT TGTTTGCAAC
TGGCTGCTTG AGCAGGTGGT GCGCACCGCT CGCCAACTCG GAGCTGCTCA TGTTTGGGCC
GACAAGATCA TGGCCACCTG GCTCTGGAGC CTCAATCAAC TCGACCTCGA CACATTCCTC
GCCAATGATG ACCTGCGCAC CGGCTATCAC CTCCTGCGTT GGCAAGACGA GGGGCCTGCA
CCTCTAGCCG AACTCTGCAA ACGCTTCCTC AATCGCCACC TACTGAAAGC CCTTGCAGTG
GAACATCTGA GCCATAGCAA CCAGCTAGAG GTCCTCACTC TGACTCGGCA ACTAGCTGAA
CGCCAAGGCT TCGATCCAGC GCTCTGCTGT GGATTACGCC ATCAGCAACA GCGCGGTTAT
CACCCTTACA AAGGAGGGTT GCGTCTTTGG GATGGAAGGC AATTGCGAGC CCTCGAGCAA
GCCTCTCCTC TGGTCGCCAG CTTGATTACC CCAGCCGAAT CTTCATGGTT GATTTATCCC
CGAGAGATTC ATGGGGAGCT TCAAGCTGCG CTGGCAACCT AG
 
Protein sequence
MSLRTYHDPL HRGITLNASD AAEAMVMQLI DAAPFQRLRR IRQLGPAFLT FHGAESSRFT 
HSLGVFHLAR LALERLLQFD SGLEEHRGLL YGAALLHDLG HAPLSHTGEE IFGLDHETWS
ARLVREHPAV RAPLEAYAPG TADGVANLLE RGSSPHSVIK ALVSSQLDCD RLDYLLRDSY
STGAHYGQLD LERILSALTL APDGAMAIHP KGLMAVEHYL VVRNLMYRSV YNHRLNVVCN
WLLEQVVRTA RQLGAAHVWA DKIMATWLWS LNQLDLDTFL ANDDLRTGYH LLRWQDEGPA
PLAELCKRFL NRHLLKALAV EHLSHSNQLE VLTLTRQLAE RQGFDPALCC GLRHQQQRGY
HPYKGGLRLW DGRQLRALEQ ASPLVASLIT PAESSWLIYP REIHGELQAA LAT