Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24981 |
Symbol | |
ID | 4777294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2194318 |
End bp | 2195925 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640088019 |
Product | RecB family nuclease |
Protein accession | YP_001018494 |
Protein GI | 124024187 |
COG category | [R] General function prediction only |
COG ID | [COG2251] Predicted nuclease (RecB family) |
TIGRFAM ID | [TIGR03491] RecB family nuclease, putative, TM0106 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGCCA CCCCTCTCGC TGCCAACGTT CTGACTGATC GCTTGCTGCG TAGTTGGCTG CGCTGTCGTC GTAGGGCTTG GCTAGATCGT TATGGCGATG GAGAGCAACG GCTCTGGACT GCTCACCGCA CTCTGCAACT TGATGATCAG CAGCGCAACT TTGTTGCTTT GTTGCCGCGC AAACCATGCA GGGGTCTAGA CGGTTGCAGT CAAGGCTGCC CAGGGGTGGT GGGGCTGAGG CTTAAGGGGG TTGGTCCTGC GGGGCAGTTA TTGGAGGCCC ATCCGCCATT GCTGCAGAGG GTCGAGGGGC AAAGTCGTTG GGGGGCATTT GCTTACCGAC CGGTGCTTGC ACGTCAGGGT CGACGCTTGA CCAGAGAGCA TCGCTTGGCT TTGGCCCTTG CTGGTCGTTT ATTGGCACCG CTGCAGTCGG CTCCCGTGCC TGAAGGCTTG GCTTTGGCCG GAGCTGGTCG CAGTCTTCAC ATGGAACGAG TTTCATTGCT GGGCGGGCTG CAGCGGCAGC TTGATGATGT CCTGGTCAAG TTGGCCGCAG ACCTCGAGCT GAGTGAGCCT CCGCCTTTGG TTGCTGATCG GCGTAAGTGC AAGTTGTGTT CCTGGCGAGG TGTTTGCAAT GCCGTGGCTT CTGTAGAGGG ACATCTCAGT GAGGTGAGCG GTATCGGGAC TCGTCGACGG CAGATGCTTC AGGAACTGGG GATCCTTGGT TTGCAGGATT TAGCGGCAGC CGATCCGAAT GAGCTCGGAA GTCGTTTGCA ACATTTCGGT GAGCAGCACG GGGAAGTGGC TTGTGAGCTT GTCGCTCAGG CCCGGGCTCA GCGGGATGGT CGTTATGAGC GATTGGACTC CGCATCAGCT TTGCCGGAAT TGGCCACTGC CCCTGGCGTG TTGTTGTACG ACATCGAATC TGATCCAGAT GCTCGCGATG ATTTTCTGCA TGGTTTTGTC CGGCTGGGCC GCAGGCCAGA TGGCAGTTGG GATTTAGAGG GCGCGCAGTA TCACCCCTTT TTGGTGCTTT ATGAGCACGG CGAGGCACGT TGCTGGCAGC GGTTGCAACG CATGCTGAAG AGTTATCCCG ACTGGCCAGT GATGCATTAC GGCGAAACGG AGTCTCTAGC TCTTCGCCGT ATGGCTAAGC GGCAGGGAGT GGACGCGGCT GAGTTGAGTG CACTGAGCAA GCGCATGATT GATGTGCACG ATCGGGTGCG GCGTTCTTGG CGATTGCCTT TAAACAGCTA TGGGTTGAAG TGCGTGGCGA GTTGGCTGGG ATTTTGTTGG CGTCAGGTGG GTGTCGATGG GGCTCGAGCT CTGCTTTGGT GGCGCCAGTG GCGTGGTTCA GGTCTTCAAG ATCGCGGCAG TTCCTATGCC CTGCGTTGGA TCTTTGATTA CAACCACGAT GATTGTCTCG CCACTTGGGC CGTGGCGGCA TGGCTGTTAA AGCAAGACGA CCTGTTAAAG CAAGACGACC TGTTAAAGCA AGACGACCTG TTAAAGCAAG ACGACCTGTT AAAGCAAGAC GACCTGTTAA AGCAAGACGA CCTGTTAAAG CAAGACGACC TGTTAAAGCA AGACGACCTG TTAAAGCAAG ACGAGTAG
|
Protein sequence | MGATPLAANV LTDRLLRSWL RCRRRAWLDR YGDGEQRLWT AHRTLQLDDQ QRNFVALLPR KPCRGLDGCS QGCPGVVGLR LKGVGPAGQL LEAHPPLLQR VEGQSRWGAF AYRPVLARQG RRLTREHRLA LALAGRLLAP LQSAPVPEGL ALAGAGRSLH MERVSLLGGL QRQLDDVLVK LAADLELSEP PPLVADRRKC KLCSWRGVCN AVASVEGHLS EVSGIGTRRR QMLQELGILG LQDLAAADPN ELGSRLQHFG EQHGEVACEL VAQARAQRDG RYERLDSASA LPELATAPGV LLYDIESDPD ARDDFLHGFV RLGRRPDGSW DLEGAQYHPF LVLYEHGEAR CWQRLQRMLK SYPDWPVMHY GETESLALRR MAKRQGVDAA ELSALSKRMI DVHDRVRRSW RLPLNSYGLK CVASWLGFCW RQVGVDGARA LLWWRQWRGS GLQDRGSSYA LRWIFDYNHD DCLATWAVAA WLLKQDDLLK QDDLLKQDDL LKQDDLLKQD DLLKQDDLLK QDDLLKQDDL LKQDE
|
| |