Gene NATL1_03571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03571 
Symbol 
ID4779283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp329148 
End bp330599 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content30% 
IMG OID640083625 
ProductRecB family nuclease 
Protein accessionYP_001014186 
Protein GI124025070 
COG category[R] General function prediction only 
COG ID[COG2251] Predicted nuclease (RecB family) 
TIGRFAM ID[TIGR03491] RecB family nuclease, putative, TM0106 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.551533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGTA ATAAATCCAA ACCAATTAAC GATCACCTTT TAAGGAGTTG GATTAGATGC 
AGAAGAAAGG CCTGGCTTGA TATTTATGGT GATAAACAAA AAAAGTTGTG GACGGCCCAC
AGCACACTTC AATTAAATCA TCAAATCGAC TGCTTTCATA ATCTCTCACA AAAAAGTTAT
GGGATAGGAA TTCAAGCATG TGAAGAGGGA AAAAATATTG CTTATGGAGT CAGAATTAAA
TACCCTCTAA TAAAGAATAG AATTATCAAA GCAAATTTAC CAATACTAAG GAAGACATCT
GGAGAGAGTA TTTGGGGTAA TTATGCATAT CAACCTATAT TAGCTAGGCA AGGTAAGAAA
ATTACAAGAG AGCATAAATT AACACTGGCA ATGACTAGTT TATTAGTAAA TAATTTACAA
AAATTTCAAG TACAAAAAGG GTTAATATTA CACAAAGAGA ATAAAGTTCT CAAAGTCGAA
AAGATAAAAT TAAGCGATAA TATAAACACA GATTTAATAG GTTCATTATT AAACCTAGAG
AAAGATATTG AATCAAGAAA TCCTCCACCC ATAACTTCTA ATAGAAAAAA ATGCACAATA
TGTTCATGGA GAAAAGATTG TGATGCTGTA GCAATTAAAG AAGGTCGCTT AAGTGAAGTT
AGCGGGATTG GAGAAAAAAG AGAACTTTTA TTAAACAAAA TTGGCATAAA TAATATAGAG
GAACTAGCAA AGATTAAGCA TTATAAATTA AAGGAAAAGC TTGAAGTATT TGGAACACAA
CATGGTGATA TTTCCAAACA AATTATTTTA CAATCTCAAT CTCGATCAAC GAATAGGGCA
ATCAAATTAA ATCCAGAAAT AGAACTAAAT AATCTAAAGA AAGCAAAAGG TTTATATATT
TATGATATTG AATCTGACCC AGATATTAAA CATGACTTTT TACATGGATT TATTCGATTA
CCAAAAAATA TAAAAAATGA AATAAGTTTA GAACAAACTA AATATTCACC ATTACTTAAT
CTTGAAAAAA ACACTGAAAG TTTTCTATGG AAAAGAATAA CTAAAAAATT AAGTATTAAT
CCAGACTATC CCATCATCCA TTATGGGGAA ACAGAACCAA TATCTTTGCT AAAGCTAGGA
TTGCGACAAG GAGCTAATCC TCATGAAATT GAAGAATTAA AAAAAAGATT TATTGATATA
CATTTATTAA TTAGAGAATA CTGGTGTCTT CCCGTAAGAA ATTATGGTCT TAAATCAATC
GCAGAATGGA TAGGATTTGA ATGGAAACAA TCAAATGCAG ATGGAGCTAG AGCTCTCTTA
TGGTGGAGAC AATGGAAAAA ATCACGTAAA ATAAATAAAA TGTATTCTAG AAATTTAAAT
TCAATTTTTG AATATAACCG TGATGATTGT ATAGCCACCT TAATGATCGC AAAGTGGTTA
ATAGATCACT AG
 
Protein sequence
MKCNKSKPIN DHLLRSWIRC RRKAWLDIYG DKQKKLWTAH STLQLNHQID CFHNLSQKSY 
GIGIQACEEG KNIAYGVRIK YPLIKNRIIK ANLPILRKTS GESIWGNYAY QPILARQGKK
ITREHKLTLA MTSLLVNNLQ KFQVQKGLIL HKENKVLKVE KIKLSDNINT DLIGSLLNLE
KDIESRNPPP ITSNRKKCTI CSWRKDCDAV AIKEGRLSEV SGIGEKRELL LNKIGINNIE
ELAKIKHYKL KEKLEVFGTQ HGDISKQIIL QSQSRSTNRA IKLNPEIELN NLKKAKGLYI
YDIESDPDIK HDFLHGFIRL PKNIKNEISL EQTKYSPLLN LEKNTESFLW KRITKKLSIN
PDYPIIHYGE TEPISLLKLG LRQGANPHEI EELKKRFIDI HLLIREYWCL PVRNYGLKSI
AEWIGFEWKQ SNADGARALL WWRQWKKSRK INKMYSRNLN SIFEYNRDDC IATLMIAKWL
IDH