Gene P9303_28011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28011 
Symbol 
ID4777733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2466782 
End bp2468791 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content43% 
IMG OID640088324 
Producthypothetical protein 
Protein accessionYP_001018796 
Protein GI124024489 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATGATC AGCATCATCA CTCAGCAAGA TTCGACCAGC AGGGATCGGA TGCTCCAGAA 
AATTTAATTA TTTTCATTCG AGATCAAGTC AAGCCGGAAG ATCTTTGGCT TCCGCGTGAA
TGGGCAGCAG AAAACCTGCC AACTCGTCAA TGGCTGGTCG ATAACGGACT GTCATTTACG
AACTCATTCA CGAATACTGC AATGTGTTCG GTCTCAAGGT CGACTTTTTT TACAAGTAAA
TTTCCGGCAC AACATCAAGC CGACCTTTTG CTTTCTGATA TTGATAGTTC ATATCTTAAT
GATCAGGTAC AACTCAATCC GGATTTTCCC AACCTGGCGA CAATCCTTAA AGACCAGGGA
TATGATGTTT CCTTTTTTGG CAAGGCACAT CTAAGCAAGA CATTTACATT GAACGATGGA
GAAGTTGTTT ACCAAGATAT GAATGCATAT GGCTTTGACG ATTGGGTGGG GCCAGATGCA
GGGCAGGACA TGAAGCCCGA AAATGCCGGC GGCGGTCCTA ATGATAATGA CGGACGCTTT
ACTTCTGAGG CAAAACAATG GCTACAAAAT CGTATTCAGT CAGACAATGA AAAACCTTAT
GCTTTGGTTG TTTCCTTAGT AAACCCTCAT GACGTGTTGT CTTATCCAAA GACCTATAAC
ACAGACTTTG AATATGATCG AAAATGGATT CATGGAGATA TCGAGATTCT CCCTCCAACG
GTAGATGAAG ATAAGGAAGA AACCTTGAAA CCAAGCGTTC AACGGCAATG GATGATTCCA
CAAAATGCTG GTCAGCCAAT GCCAACCGAT AAGATGAAAT TGAATTACCT CAACTTCTAT
GGAAATTTAA TGAAGAGAGC CGATTGGCAA ATGGGAGAAA TACTTGATGT TATTCGCGAT
TCAGACAATC CAGATGATGT TAATAATACG ATGATTGTCA GTACCAGTGA CCACGGTGAG
ATGGGCATGT CTCATGGTGG CATGGTTCAG AAGATGTTTA ATGCCTATGA TGAGAGTCTT
AAAGTGCCAA TGATTTGGTC AAATCCATCT TATTTTAAAG GCTCTCAAGA GAGTGACGCA
CTCATCTCTT TAATTGACTT TTTGCCTACA TATGCAAATT TCCAGGATTT TTCAGAAGAC
TATATTGCTC AACAGGATCT TCGCGGTGTA GATTATTCTT CGATTTTAAG GCGCGCCAGG
GAAGGTGAGT CCAAGAGCCT AGAGGGCTTG GATGTACAAG ACTCTATTTT ATACACTTAC
GATGATATCT ACGCTGGCCA AGATCCAGCT CTCTGCGAAG ATCCAGTTCA TGGCTTATTA
CCTGCAGCCA ACAGAATTCA GGCAGTTCGT ACAAAAGATT TTAAATACGC CCGCTATTAC
TCTGGGGACC AAGATTATGA ACCCGCAAAT TGGGAAGGTG AGCTTTATGA TTTAAGGCCT
GAAGGCGGCG ACTATTATCC AGATATTGAC CCAATTACCG GACAGCTAAA TCCTTTTAGA
GCAGCACCTT TAGAAGTGAG AAACCTTGAC CCTAAGGCAG AAACTCGTCG CAGACTTTTG
CAGAGATTTG GGATCGGTGA TGGCCCTATT GCAACCAAGA AACAGAAAAA GGCCTATTTA
GAGATGTCGG AGTTGCTTGA TCAGCAGATT GCTGACCGGC TACAACCTCT ACCTGAATCA
GATCCGATTA AACCATCTAT CTTTGTTTAT CAAGGCGGCT CTAGTGGTGA TCAGTCTGCC
TATAAAGTTG GGGACTCGAT TGTTCGCTTC ATCCCAAATA GTGAAGATGA GAAAGGCCTA
GAGTTGGCCT TTAATACAAG ATATGGTCAG ACATATAATC TTGTCTATTC GGAGCAAAGA
GATCCTTATG CTAGTTACAC TTATCTACCC TTCGAAACTA TAATCGGTAC TAATGGACCT
ACTTATCAGT ATCTTCCTGG CTTGTCAGCA GAGATGACCC TTGATCAGAT TTATATTCAG
TGGTCTGAAG GATTTGTCCC TCTCGCTTAG
 
Protein sequence
MDDQHHHSAR FDQQGSDAPE NLIIFIRDQV KPEDLWLPRE WAAENLPTRQ WLVDNGLSFT 
NSFTNTAMCS VSRSTFFTSK FPAQHQADLL LSDIDSSYLN DQVQLNPDFP NLATILKDQG
YDVSFFGKAH LSKTFTLNDG EVVYQDMNAY GFDDWVGPDA GQDMKPENAG GGPNDNDGRF
TSEAKQWLQN RIQSDNEKPY ALVVSLVNPH DVLSYPKTYN TDFEYDRKWI HGDIEILPPT
VDEDKEETLK PSVQRQWMIP QNAGQPMPTD KMKLNYLNFY GNLMKRADWQ MGEILDVIRD
SDNPDDVNNT MIVSTSDHGE MGMSHGGMVQ KMFNAYDESL KVPMIWSNPS YFKGSQESDA
LISLIDFLPT YANFQDFSED YIAQQDLRGV DYSSILRRAR EGESKSLEGL DVQDSILYTY
DDIYAGQDPA LCEDPVHGLL PAANRIQAVR TKDFKYARYY SGDQDYEPAN WEGELYDLRP
EGGDYYPDID PITGQLNPFR AAPLEVRNLD PKAETRRRLL QRFGIGDGPI ATKKQKKAYL
EMSELLDQQI ADRLQPLPES DPIKPSIFVY QGGSSGDQSA YKVGDSIVRF IPNSEDEKGL
ELAFNTRYGQ TYNLVYSEQR DPYASYTYLP FETIIGTNGP TYQYLPGLSA EMTLDQIYIQ
WSEGFVPLA