Gene A9601_03471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03471 
Symbol 
ID4717036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp317130 
End bp318386 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content29% 
IMG OID640078051 
ProductHD superfamily phosphohydrolase 
Protein accessionYP_001008742 
Protein GI123967884 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTA AAAGAATTTT TCATGACCCA ATTCATAAAG AAATAGTATT TGATGCAGGA 
AAGCCAGAAG AATTAATGAT TTTGGAATTA ATTGATACAG TCGCTTTTCA AAGACTAAGA
AGAATAAAAC AACTTGGAGC GGCATCATTA CTTTTTCATG GTGCAGAATC GAGTAGATTT
ACTCACTCAA TTGGTGTTTT TTGTATAGCT AGAAAAATTT ATAAGAGATT AATTGAAAAT
AAATCTTCTT TTAGTGAAAA TAAATTTGTT CTTTATGGAG CAGCTCTGCT ACATGATTTA
GGTCATGGAC CTTTAAGCCA TACCAGTGAA ACAATATTCA TGCATGATCA CGAACAATGG
TCTGAAAACT TAGTGACAAA TTATTCTCCA ATAACTTCAA TCCTCAAAAA GTATGACAAC
GAATTACCCA GAAAAATTGG TGAATTATTT CAATCAAAAC AACTATTTTC AAAACCTTTA
AGAACATTGA TAAGTAGTGA GATAGATTGT GATCGTCTTG ATTATCTTTT ACGCGATAGT
TACAACACAG GGACTAATTA TGGCTTAGTA GATTTAGAAA GAATAATTTC AGCTCTTACC
TTTTTACCTG ATGGAAATAT CGGAATCAAA CCAAAAGGAG TGATTGCCAT TGAGCATTTC
CTTGTACTTA GAAACTTGAT GTATAGAACA ATCTACAATC ACAAGATAAA CGAAATATCA
ACATGGATTC TGGGGAAAAT ATTACACACA ATAAAACATA ATTATGAAAA TAATATTTGG
TTAGATAATT CTCTACATAA ATGGATTTTT TCACCATCAA AAGTTGATTT TGATGATTTC
ATAAGAAATG ATGATGTAAC CTTTTATTAT CATTTAATCA GATGGAAAGA TGAATCTTTT
GAACCACTTT CTACACTATG CAAAATGTTT ATTGACAGGG ATTTATTAAA GGCATCAGAC
ATTAGTTTTT TAAGTAAGAT CAATAGATTA AAAATCCTTG CATTTGCCAC AAAATTATGT
GAAAAGAATG GTTATGATTC AGAAATATTT TGTGGGATTA AGGAAAGGTC TTTCAAAGGT
TTTGAATCTA ATAATGCTCT AAAAATATGG GACGGCACTT ATCAAAGCGC ATTAGAAAAT
AGTTCCGCAT TAATAAAAAC TTTAATGAGA TCCGATGAAA GCTCTTTTAT TATTTATCCA
GATATGATCA AAAATGAAAT CAAAAATGAA ATTTCATCGA TAAAAAACAA TTTCTAG
 
Protein sequence
MSIKRIFHDP IHKEIVFDAG KPEELMILEL IDTVAFQRLR RIKQLGAASL LFHGAESSRF 
THSIGVFCIA RKIYKRLIEN KSSFSENKFV LYGAALLHDL GHGPLSHTSE TIFMHDHEQW
SENLVTNYSP ITSILKKYDN ELPRKIGELF QSKQLFSKPL RTLISSEIDC DRLDYLLRDS
YNTGTNYGLV DLERIISALT FLPDGNIGIK PKGVIAIEHF LVLRNLMYRT IYNHKINEIS
TWILGKILHT IKHNYENNIW LDNSLHKWIF SPSKVDFDDF IRNDDVTFYY HLIRWKDESF
EPLSTLCKMF IDRDLLKASD ISFLSKINRL KILAFATKLC EKNGYDSEIF CGIKERSFKG
FESNNALKIW DGTYQSALEN SSALIKTLMR SDESSFIIYP DMIKNEIKNE ISSIKNNF