Gene P9211_03041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03041 
Symbol 
ID5731519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp287531 
End bp289003 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content35% 
IMG OID641284651 
ProductRecB family nuclease 
Protein accessionYP_001550189 
Protein GI159902845 
COG category[R] General function prediction only 
COG ID[COG2251] Predicted nuclease (RecB family) 
TIGRFAM ID[TIGR03491] RecB family nuclease, putative, TM0106 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0542901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCC CTCCAATAGA AAGCAAGATA ATTACAGATC GTTTGCTAAG CAGTTGGATT 
CGCTGCAAGC GTAAAGCCTG GTTGGATGTA TATGAAAATA AAGAAAGAAA AATTTGGCTA
GCTCACAGAA GCCTGCAACT TGATCACCAA TACAAAAGTC TCAAAGCATT TTCCTGTACA
AAGCCAGGTT ATGGCATTAA AGGTTGCCTT AAAGGAGAGG AAAACGTTGT TGGGATCAGA
CTAAAAACAA GCAATTTTTT TAATCATCAC ATAGAAGCTC ACCCTTTGAT ACTTCACAAA
ACAAAAGGGA ATAGTTGCTT TGGGAACTTC AAATACCAAC CCGTAATTGT CAGGCAAGGG
CGTAGAATTA CAAGAAATCA TAGACTTAGC CTTGCCTTAT GGGGGCATTT ACTAGAACAA
TTCCAGCAAT CATCTATAGA CGAAGGTTTA GCCATCTCAT TAACTAAAAA CGGCCTAGAG
ATAGATAAAG TTTTCCTTAG TAAGAAGCTT CACCAGCAAT TACTTAATTC AATAAATAAA
TTGATTATAG ACTTAGATAA AAAAGATCCG CCATCACTAA CCTCAGACAG AAAAAAATGT
GTGCTTTGCC CCTGGAATAA AGTATGCAAT AAGAAAGCTT TAGAAGAAGG TCATCTAAGT
GAAATCAATG GAATTGGTTC AAAGCGACAG GAAATTCTTC AATCAATTGG AATTAACAAT
ATTACAGAAT TAGCAAAAGC AAAGTCAATT TTTCTAAGCG ATAAGCTAGA CACTTTTGGG
GGCAAAAATG ACATGCTTTC CCATCAATTA ATTAATCAAG CTAAAGTACA GCTAAGCTCA
ATACCTAAGA GGATAAATAC AAACCCTGTC TTACCTGAAC TAAGCAATGT GCCTGGTGTT
ATTATTTATG ATATAGAATC TGATCCAGAT GCTAACCATG ATTTTCTTCA TGGTTTTATT
TCTATTAGGA AAACAGGAAT CAGGAATTGG GATGTAGATA ATTTTAATTA TAATAATATA
CTTACTATCT CTAATAAGGA TGAAAAAGAT ACTTTATTTG AAATTTATAA ACAATTTAAT
AATTTTAATT CCTGGCCCAT TCTTCATTAT GGAGAGACAG AATACTTGTC TTTTCAAAAG
ATGGCAAAGC GTCATGGGAT GCCAGATTTA GAATTAAATT CTATTCAAAA TAGATTTGTA
GATGTTCACG AAAGAGTAAG AAAACATTGG CTATTACCAG TTAATAGCTA TGGCCTCAAA
GGTGTTGCTC AATGGTTAGG TTTTAAATGG GACCAAAAGA ATGTGGATGG AGCTCAAGCT
TTACTATGGT GGAGACAATG GCGTTCAACA CAAAACAATT CAATAGTGCA CAAGGCTAAC
CTTAAAAAAC TCCTCAGATA CAATCAGGAT GACTGCATAG CAACCTGGGT TATAGCACAA
TGGCTTTTAA ATAATGATAG AGAACATAAA TAG
 
Protein sequence
MSIPPIESKI ITDRLLSSWI RCKRKAWLDV YENKERKIWL AHRSLQLDHQ YKSLKAFSCT 
KPGYGIKGCL KGEENVVGIR LKTSNFFNHH IEAHPLILHK TKGNSCFGNF KYQPVIVRQG
RRITRNHRLS LALWGHLLEQ FQQSSIDEGL AISLTKNGLE IDKVFLSKKL HQQLLNSINK
LIIDLDKKDP PSLTSDRKKC VLCPWNKVCN KKALEEGHLS EINGIGSKRQ EILQSIGINN
ITELAKAKSI FLSDKLDTFG GKNDMLSHQL INQAKVQLSS IPKRINTNPV LPELSNVPGV
IIYDIESDPD ANHDFLHGFI SIRKTGIRNW DVDNFNYNNI LTISNKDEKD TLFEIYKQFN
NFNSWPILHY GETEYLSFQK MAKRHGMPDL ELNSIQNRFV DVHERVRKHW LLPVNSYGLK
GVAQWLGFKW DQKNVDGAQA LLWWRQWRST QNNSIVHKAN LKKLLRYNQD DCIATWVIAQ
WLLNNDREHK