Gene P9303_24981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_24981 
Symbol 
ID4777294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2194318 
End bp2195925 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content57% 
IMG OID640088019 
ProductRecB family nuclease 
Protein accessionYP_001018494 
Protein GI124024187 
COG category[R] General function prediction only 
COG ID[COG2251] Predicted nuclease (RecB family) 
TIGRFAM ID[TIGR03491] RecB family nuclease, putative, TM0106 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCCA CCCCTCTCGC TGCCAACGTT CTGACTGATC GCTTGCTGCG TAGTTGGCTG 
CGCTGTCGTC GTAGGGCTTG GCTAGATCGT TATGGCGATG GAGAGCAACG GCTCTGGACT
GCTCACCGCA CTCTGCAACT TGATGATCAG CAGCGCAACT TTGTTGCTTT GTTGCCGCGC
AAACCATGCA GGGGTCTAGA CGGTTGCAGT CAAGGCTGCC CAGGGGTGGT GGGGCTGAGG
CTTAAGGGGG TTGGTCCTGC GGGGCAGTTA TTGGAGGCCC ATCCGCCATT GCTGCAGAGG
GTCGAGGGGC AAAGTCGTTG GGGGGCATTT GCTTACCGAC CGGTGCTTGC ACGTCAGGGT
CGACGCTTGA CCAGAGAGCA TCGCTTGGCT TTGGCCCTTG CTGGTCGTTT ATTGGCACCG
CTGCAGTCGG CTCCCGTGCC TGAAGGCTTG GCTTTGGCCG GAGCTGGTCG CAGTCTTCAC
ATGGAACGAG TTTCATTGCT GGGCGGGCTG CAGCGGCAGC TTGATGATGT CCTGGTCAAG
TTGGCCGCAG ACCTCGAGCT GAGTGAGCCT CCGCCTTTGG TTGCTGATCG GCGTAAGTGC
AAGTTGTGTT CCTGGCGAGG TGTTTGCAAT GCCGTGGCTT CTGTAGAGGG ACATCTCAGT
GAGGTGAGCG GTATCGGGAC TCGTCGACGG CAGATGCTTC AGGAACTGGG GATCCTTGGT
TTGCAGGATT TAGCGGCAGC CGATCCGAAT GAGCTCGGAA GTCGTTTGCA ACATTTCGGT
GAGCAGCACG GGGAAGTGGC TTGTGAGCTT GTCGCTCAGG CCCGGGCTCA GCGGGATGGT
CGTTATGAGC GATTGGACTC CGCATCAGCT TTGCCGGAAT TGGCCACTGC CCCTGGCGTG
TTGTTGTACG ACATCGAATC TGATCCAGAT GCTCGCGATG ATTTTCTGCA TGGTTTTGTC
CGGCTGGGCC GCAGGCCAGA TGGCAGTTGG GATTTAGAGG GCGCGCAGTA TCACCCCTTT
TTGGTGCTTT ATGAGCACGG CGAGGCACGT TGCTGGCAGC GGTTGCAACG CATGCTGAAG
AGTTATCCCG ACTGGCCAGT GATGCATTAC GGCGAAACGG AGTCTCTAGC TCTTCGCCGT
ATGGCTAAGC GGCAGGGAGT GGACGCGGCT GAGTTGAGTG CACTGAGCAA GCGCATGATT
GATGTGCACG ATCGGGTGCG GCGTTCTTGG CGATTGCCTT TAAACAGCTA TGGGTTGAAG
TGCGTGGCGA GTTGGCTGGG ATTTTGTTGG CGTCAGGTGG GTGTCGATGG GGCTCGAGCT
CTGCTTTGGT GGCGCCAGTG GCGTGGTTCA GGTCTTCAAG ATCGCGGCAG TTCCTATGCC
CTGCGTTGGA TCTTTGATTA CAACCACGAT GATTGTCTCG CCACTTGGGC CGTGGCGGCA
TGGCTGTTAA AGCAAGACGA CCTGTTAAAG CAAGACGACC TGTTAAAGCA AGACGACCTG
TTAAAGCAAG ACGACCTGTT AAAGCAAGAC GACCTGTTAA AGCAAGACGA CCTGTTAAAG
CAAGACGACC TGTTAAAGCA AGACGACCTG TTAAAGCAAG ACGAGTAG
 
Protein sequence
MGATPLAANV LTDRLLRSWL RCRRRAWLDR YGDGEQRLWT AHRTLQLDDQ QRNFVALLPR 
KPCRGLDGCS QGCPGVVGLR LKGVGPAGQL LEAHPPLLQR VEGQSRWGAF AYRPVLARQG
RRLTREHRLA LALAGRLLAP LQSAPVPEGL ALAGAGRSLH MERVSLLGGL QRQLDDVLVK
LAADLELSEP PPLVADRRKC KLCSWRGVCN AVASVEGHLS EVSGIGTRRR QMLQELGILG
LQDLAAADPN ELGSRLQHFG EQHGEVACEL VAQARAQRDG RYERLDSASA LPELATAPGV
LLYDIESDPD ARDDFLHGFV RLGRRPDGSW DLEGAQYHPF LVLYEHGEAR CWQRLQRMLK
SYPDWPVMHY GETESLALRR MAKRQGVDAA ELSALSKRMI DVHDRVRRSW RLPLNSYGLK
CVASWLGFCW RQVGVDGARA LLWWRQWRGS GLQDRGSSYA LRWIFDYNHD DCLATWAVAA
WLLKQDDLLK QDDLLKQDDL LKQDDLLKQD DLLKQDDLLK QDDLLKQDDL LKQDE