Gene NATL1_01421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01421 
Symbol 
ID4779238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp139659 
End bp141863 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content36% 
IMG OID640083406 
Productrecombination factor protein RarA/unknown domain fusion protein 
Protein accessionYP_001013971 
Protein GI124024855 
COG category[L] Replication, recombination and repair 
COG ID[COG2256] ATPase related to the helicase subunit of the Holliday junction resolvase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.869861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCAAAG ATCTGTTTGC TTTTAATGGT GAACAGCTAA TTCAGAATAA TGCTCCTTTA 
GCTGATCGCT TACGGCCTCA AACACTGGAT GAATTTGTTG GTCAAGATCA CATTCTTGCT
CAAGGACGTT TATTGAGACG TTCAATTGTT GCTGACAAAG TAGGCAATTT ATTGCTTTAT
GGACCCCCTG GAGTTGGTAA GACTACTTTG GCTAGGATTA TCGCCTTAAA TACCCTATCT
CACTTTAGTG TCGTAAATGC AGCACTGGCT GGCATCAAGG ATTTGAGATC TGAGATAGAG
TCGGCAATCG ATAGATTAAA TAAATTTGGT AAACGCACAA TTTTATTTAT TGATGAGGTT
CATAGATTTA ATACTGCTCA ACAAGATGCC TTATTACCTT GGGTTGAAAA TGGAACTTTG
ACCCTTATTG GGGCCACAAC GGAAAATCCA TATTTTGAAG TAAATAAAGC GTTGCTAAGT
AGATCTAGAC TATTTCGTTT AAATAGTCTG AATTCAAAAG CATTACATCA ATTGCTGCAA
CGAGCTTTGA ATGATAAGAA GAGGGGATAT GGATTGAAAT TAATCAATTT AGCTAGTGAA
GCTGAGGATC ATTTGGTTGA TGTGTGTAAT GGCGATGCAC GCGTTCTGCT TAATGCGCTT
GAACTTGCTG TAGAGAGCAC TATTGCAAAT CAAGATACTT CAATCAATAT TGATCTCAAG
ATTGCTGAGG ATTCAATTCA AGAACGAGCG GTTTTATACG ACAAAAAAGG TGATGCTCAT
TTTGATACTA TCAGCGCTTT TATTAAGTCA TTAAGAGGCT CGGATCCTGA TGCGGCATTG
TTTTGGCTTG CTCGAATGTT GGAGGCTGGA GAAAGTCCAC GATTCATTTT TAGACGTATG
CTCATCGCAG CAGGAGAAGA TATTGGTCTT GCTGATCCCA ATGCAATTGT CATAGTTGAG
TCATGCGCTG CGGCTTTTGA TCGAATAGGT TTGCCAGAGG GTGTTTACCC ACTGGCTCAG
GCAACCTTGT ACTTGGCTTC AACTGAGAAA AGCAATAGTG TGAAGGCTAT TTTTAAGGCA
GTTCAGAAAG TTAAAGATTC CCAAAAGCAA AATGTTCCAT CTCATCTTAA AGATCCAAAT
CGAGATCAAG AATCTTTTGG AGATGGCATG GGTTACAGGT ATCCACATTC ATTTTCAAAA
CACTGGGTTC CACAGCAATA TTTGCCAGAC ACTTTGCTAA ACGAGATTTT TTGGGAGCCA
ACTGAACATG GATGGGAAGG GCAAAGACGA TCTCTTTTGA ATGAGAGAAG ATCTGAACAG
TTAGCTTCAT TAATTGAAGT TGAGCAACAA AATCCTTTAA CTATTACATC TGGCAAAGTT
GATAATGATT TGGAAAAATG GTTATCTCGC CAAGTTTTGC AGGAGGGGGA ACGATTAAAA
AATTTGATGA CTAAATTATG GTCCGGCATT ACTTGGAAAA AAAATCATAG GGTTTTAGTT
TTAGCACCTA GTTCTTTGCT TTGGTCTTTA AAGCCTTTAA GAGAGGTATC TGAAGGGGGC
GTTTTTTTGG CTGTATCAGA GGATAATCAT CCCAAGTTAT TAGCTGAATT AGAAGTTTTA
GCGCCTATGG AGCGACCTGT TTTAATTGAT TCAAAAGTTG AATCAATTAA AAAATTAGAA
GACAATCTCA AGTTTGAGGT AATTGGAGGA AGGATTCCTT GGAAAGTTTT TTCTGAAACA
AATTTCTTTG AATTGTGGCC GATTCTTACT GAAAAATGTA CGGCGAATAC AGCATTAAGT
TTGATTATAA GTAACCCATG TTCTGGCCCT GCCTTTTCTT TAAAGGAAAG ATTAGAGTTT
TATAGTAATA AGAAAAATAC TGATTTTTCA TTCTTGAGTG ATTTAATTTG TAAAGAGGAG
AAGTGGTTAA ATAAGCAAGA ACATAAAAAG AAATTTATTC TACAATTAGA AAAATTAGGC
TGGAATATTT CTTTTGAAGA ATGGACTGAG TTTGTATATC AAAAAGTTGA CAATACTATA
ATTAAAAGGT GGCTTAATCA GGGAAGTGAG TATCGAGAAA TTATTCTCAA AAATTGTGAG
GAAGAAACAT TAATTCGATT GCAAGAATTA TTTAAAAGAT TGGAAGGCCA GACTATAAAA
CAGAAGCTTA TACATACTAA ATTTCTTGCT AAGAATAGTA ATTAA
 
Protein sequence
MSKDLFAFNG EQLIQNNAPL ADRLRPQTLD EFVGQDHILA QGRLLRRSIV ADKVGNLLLY 
GPPGVGKTTL ARIIALNTLS HFSVVNAALA GIKDLRSEIE SAIDRLNKFG KRTILFIDEV
HRFNTAQQDA LLPWVENGTL TLIGATTENP YFEVNKALLS RSRLFRLNSL NSKALHQLLQ
RALNDKKRGY GLKLINLASE AEDHLVDVCN GDARVLLNAL ELAVESTIAN QDTSINIDLK
IAEDSIQERA VLYDKKGDAH FDTISAFIKS LRGSDPDAAL FWLARMLEAG ESPRFIFRRM
LIAAGEDIGL ADPNAIVIVE SCAAAFDRIG LPEGVYPLAQ ATLYLASTEK SNSVKAIFKA
VQKVKDSQKQ NVPSHLKDPN RDQESFGDGM GYRYPHSFSK HWVPQQYLPD TLLNEIFWEP
TEHGWEGQRR SLLNERRSEQ LASLIEVEQQ NPLTITSGKV DNDLEKWLSR QVLQEGERLK
NLMTKLWSGI TWKKNHRVLV LAPSSLLWSL KPLREVSEGG VFLAVSEDNH PKLLAELEVL
APMERPVLID SKVESIKKLE DNLKFEVIGG RIPWKVFSET NFFELWPILT EKCTANTALS
LIISNPCSGP AFSLKERLEF YSNKKNTDFS FLSDLICKEE KWLNKQEHKK KFILQLEKLG
WNISFEEWTE FVYQKVDNTI IKRWLNQGSE YREIILKNCE EETLIRLQEL FKRLEGQTIK
QKLIHTKFLA KNSN