Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03571 |
Symbol | |
ID | 4779283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 329148 |
End bp | 330599 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640083625 |
Product | RecB family nuclease |
Protein accession | YP_001014186 |
Protein GI | 124025070 |
COG category | [R] General function prediction only |
COG ID | [COG2251] Predicted nuclease (RecB family) |
TIGRFAM ID | [TIGR03491] RecB family nuclease, putative, TM0106 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.551533 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATGTA ATAAATCCAA ACCAATTAAC GATCACCTTT TAAGGAGTTG GATTAGATGC AGAAGAAAGG CCTGGCTTGA TATTTATGGT GATAAACAAA AAAAGTTGTG GACGGCCCAC AGCACACTTC AATTAAATCA TCAAATCGAC TGCTTTCATA ATCTCTCACA AAAAAGTTAT GGGATAGGAA TTCAAGCATG TGAAGAGGGA AAAAATATTG CTTATGGAGT CAGAATTAAA TACCCTCTAA TAAAGAATAG AATTATCAAA GCAAATTTAC CAATACTAAG GAAGACATCT GGAGAGAGTA TTTGGGGTAA TTATGCATAT CAACCTATAT TAGCTAGGCA AGGTAAGAAA ATTACAAGAG AGCATAAATT AACACTGGCA ATGACTAGTT TATTAGTAAA TAATTTACAA AAATTTCAAG TACAAAAAGG GTTAATATTA CACAAAGAGA ATAAAGTTCT CAAAGTCGAA AAGATAAAAT TAAGCGATAA TATAAACACA GATTTAATAG GTTCATTATT AAACCTAGAG AAAGATATTG AATCAAGAAA TCCTCCACCC ATAACTTCTA ATAGAAAAAA ATGCACAATA TGTTCATGGA GAAAAGATTG TGATGCTGTA GCAATTAAAG AAGGTCGCTT AAGTGAAGTT AGCGGGATTG GAGAAAAAAG AGAACTTTTA TTAAACAAAA TTGGCATAAA TAATATAGAG GAACTAGCAA AGATTAAGCA TTATAAATTA AAGGAAAAGC TTGAAGTATT TGGAACACAA CATGGTGATA TTTCCAAACA AATTATTTTA CAATCTCAAT CTCGATCAAC GAATAGGGCA ATCAAATTAA ATCCAGAAAT AGAACTAAAT AATCTAAAGA AAGCAAAAGG TTTATATATT TATGATATTG AATCTGACCC AGATATTAAA CATGACTTTT TACATGGATT TATTCGATTA CCAAAAAATA TAAAAAATGA AATAAGTTTA GAACAAACTA AATATTCACC ATTACTTAAT CTTGAAAAAA ACACTGAAAG TTTTCTATGG AAAAGAATAA CTAAAAAATT AAGTATTAAT CCAGACTATC CCATCATCCA TTATGGGGAA ACAGAACCAA TATCTTTGCT AAAGCTAGGA TTGCGACAAG GAGCTAATCC TCATGAAATT GAAGAATTAA AAAAAAGATT TATTGATATA CATTTATTAA TTAGAGAATA CTGGTGTCTT CCCGTAAGAA ATTATGGTCT TAAATCAATC GCAGAATGGA TAGGATTTGA ATGGAAACAA TCAAATGCAG ATGGAGCTAG AGCTCTCTTA TGGTGGAGAC AATGGAAAAA ATCACGTAAA ATAAATAAAA TGTATTCTAG AAATTTAAAT TCAATTTTTG AATATAACCG TGATGATTGT ATAGCCACCT TAATGATCGC AAAGTGGTTA ATAGATCACT AG
|
Protein sequence | MKCNKSKPIN DHLLRSWIRC RRKAWLDIYG DKQKKLWTAH STLQLNHQID CFHNLSQKSY GIGIQACEEG KNIAYGVRIK YPLIKNRIIK ANLPILRKTS GESIWGNYAY QPILARQGKK ITREHKLTLA MTSLLVNNLQ KFQVQKGLIL HKENKVLKVE KIKLSDNINT DLIGSLLNLE KDIESRNPPP ITSNRKKCTI CSWRKDCDAV AIKEGRLSEV SGIGEKRELL LNKIGINNIE ELAKIKHYKL KEKLEVFGTQ HGDISKQIIL QSQSRSTNRA IKLNPEIELN NLKKAKGLYI YDIESDPDIK HDFLHGFIRL PKNIKNEISL EQTKYSPLLN LEKNTESFLW KRITKKLSIN PDYPIIHYGE TEPISLLKLG LRQGANPHEI EELKKRFIDI HLLIREYWCL PVRNYGLKSI AEWIGFEWKQ SNADGARALL WWRQWKKSRK INKMYSRNLN SIFEYNRDDC IATLMIAKWL IDH
|
| |