Gene NATL1_00041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00041 
Symbol 
ID4780497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp6051 
End bp8534 
Gene Length2484 bp 
Protein Length827 aa 
Translation table11 
GC content31% 
IMG OID640083267 
ProductDNA gyrase/topoisomerase IV, subunit A 
Protein accessionYP_001013833 
Protein GI124024717 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.885734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGG AACGCCTGAA ACCTATATCG TTGCATCAGG AAATGCAGCG TTCTTATCTC 
GAGTACGCAA TGAGCGTAAT CGTTGGGAGA GCATTGCCAG ACGCGAGAGA TGGATTGAAG
CCGGTGCAAA GAAGAATTCT TTTTGCAATG CACGAGTTAG GACTTACGCC CGAAAGGCCT
TACAGAAAAT GTGCTCGTGT CGTAGGAGAT GTTCTTGGCA AATATCATCC ACATGGAGAT
CAAGCTGTCT ACGACGCACT TGTAAGACAA GTTCAAATAT TTAATTCAAG ATACCCAATT
CTTGATGGTC ACGGAAACTT TGGATCTATT GATGATGATC CTCCTGCTGC AATGAGATAC
ACAGAGACTA GGCTGGCGCC AATATCTAAC GATGCAATTT TAAGTGAAAT AGATAAAGAA
ACTGTTGATT TTTCACCAAA TTTTGATGGA TCACAACAAG AACCAGACGT TTTACCAGCT
CAATTACCTT TCCTCATCTT AAATGGAAGT ACAGGAATTG CCGTTGGAAT GGCAACAAGC
ATTCCTCCAC ACAATTTGAA CGAAGTGGTA GAGGCCTTAA TAGCAATAAT AAAAAAACCT
TCACTAAACG AAGAAAAATT ATTAGAAATA ATACCTGGAC CAGACTTTCC AACAGGAGGA
GAGATATTAG TTAGTAACGG AATAAAAGAA ACTTACACAA AAGGGAGAGG AAGCATAACC
ATGAGAGGAA TAGCTCGCAT TGAAGAAATT AATCCTGGAA AAGGTAAACA CAAAAGAAGT
GGGATTATTA TTTCAGAACT ACCTTATCAA TTAAACAAAG CTGGCTGGAT TGAAAAATTA
GCTGACCTTG TAAATAATGG AAAAATTTCG GGGATAGCTG ATATTAGAGA TGAAAGCGAT
CGAGACGGAA TGCGAATTCT TGTTGAAGTT AAAAGAGATT CTGATCCAAA AAAAATTCTG
GATTTTTTAT ATCAAAAAAC TTCTTTACAA AGCAATTTTG GTGCAATTTT ACTTGCATTA
GTTAATGGCC AGCCAGTACA ACTTACGTTA AGAAAATTAT TAAATAATTT CTTAGAGTTT
AGAGAAAATA CTATTTTACT AAGAAGTAAT TATTTACTAA AAAATATAAA AAATAGAGAA
GAGATAGTTG AGGCTCTAAT CCAAGCAACA AATAATGTTA GAAAAGTAAT TGAATTAATT
GAAAATTCTA AAGATACACC TGAAGCAAAA AGTAATTTAA TTATCAGCTT AAAGATTAAT
GAAAGGCAAG CAGATGGGAT TCTAGGTATG CCTTTAAAGA AAATTACAGG TCTTGAAAAA
GATTCTCTCA AGAATGAACT TAAAGATTTA AAAACTAAAA GAGCAGAACT CGAATTAATA
ATTAACGACA AAGAAAATTT AATGAAAGTT ATGGTTAAAG AACTCAAAGA TCTTAAAAAA
AGATTTGGTA GTAAAAGAAG AACAAAATTA ATAGAAGGTG GAGATGCTCT TATTGCTGAA
AAGATGGCAA ATCAAAGACC AAATAAAGAA CTTCAAAGAA TAAATGCATT AAAAGAATTA
TCAAAAGATT CTGAAATAAT TATTCAATCA AATAATGAAA TAAAGATAAT TCCCTCACTA
ACAATAAAAA AATTAAAACT GAAAGAAAGT GATCAAGAAA GAAAAGATAT TCTACCTGCA
AAGCTAATAT GGCCAATTAA AAATGAACCA AAGATATTAG CCGTCAGTCA AGAAGGTAAG
ATAGGTCTTC TTAAGTGGGA ATTTGCGGGA CAAAAACCTG GGCCTTTAAA ACAATTTTTA
CCAGCAGGTT TAGAGAATGA TAAAATAATT AATCTTATTC CACTAACAGA AATAAGAGAT
ATAAGTATAG GATTAATAAG TACTGATGGA AAGTTTAAAA GAATTTCAAT TAATGAGATT
AGCGATATTT CTAATAGATC AACAACGATT TTAAAATTAA AAGACAGTAT AAAGCTTAAA
TCTTGCATTC TTTGTAAAGA GAATAGCTAT TTGTATATAG TGAGTGATAT TGGCAGAATT
ATTAAAATTA AAATAACAGA AAATGATTTC CCTTTTATGG GCAAGTTAGC TCAAGGAACA
AATATAATAA AATTATTCCC TAATGAAAAT ATAGTGGAAG CTTTAAGTTT TCAAGAAAAG
AAAAATAAAG ATTTAATTTT AATAACTAAC AAAGGTTCTT TTGTAAAACA TTCTACAAAA
GAAATAACTA TATCCAAAAA AGGAGCATTA GGAATAATGG GAATACATTT TAAAGATAAT
AAAACAATTA AAGAAAGAGT AATTGATTGT TTTATAAATA ATAAACACGT TTTTATTAAA
ACTGATAAGG ATAGATATCA AAGATTAAAA ACCGATCAAA TTGATAATAG TTCATATAGA
AAAGAGAACA AATTAAATAT AGAATTAAAC AATGATGAGT TTTTAAAATC TACTTTTTCA
ATGAAAGTAC CAGACAAAAA TTAA
 
Protein sequence
MAKERLKPIS LHQEMQRSYL EYAMSVIVGR ALPDARDGLK PVQRRILFAM HELGLTPERP 
YRKCARVVGD VLGKYHPHGD QAVYDALVRQ VQIFNSRYPI LDGHGNFGSI DDDPPAAMRY
TETRLAPISN DAILSEIDKE TVDFSPNFDG SQQEPDVLPA QLPFLILNGS TGIAVGMATS
IPPHNLNEVV EALIAIIKKP SLNEEKLLEI IPGPDFPTGG EILVSNGIKE TYTKGRGSIT
MRGIARIEEI NPGKGKHKRS GIIISELPYQ LNKAGWIEKL ADLVNNGKIS GIADIRDESD
RDGMRILVEV KRDSDPKKIL DFLYQKTSLQ SNFGAILLAL VNGQPVQLTL RKLLNNFLEF
RENTILLRSN YLLKNIKNRE EIVEALIQAT NNVRKVIELI ENSKDTPEAK SNLIISLKIN
ERQADGILGM PLKKITGLEK DSLKNELKDL KTKRAELELI INDKENLMKV MVKELKDLKK
RFGSKRRTKL IEGGDALIAE KMANQRPNKE LQRINALKEL SKDSEIIIQS NNEIKIIPSL
TIKKLKLKES DQERKDILPA KLIWPIKNEP KILAVSQEGK IGLLKWEFAG QKPGPLKQFL
PAGLENDKII NLIPLTEIRD ISIGLISTDG KFKRISINEI SDISNRSTTI LKLKDSIKLK
SCILCKENSY LYIVSDIGRI IKIKITENDF PFMGKLAQGT NIIKLFPNEN IVEALSFQEK
KNKDLILITN KGSFVKHSTK EITISKKGAL GIMGIHFKDN KTIKERVIDC FINNKHVFIK
TDKDRYQRLK TDQIDNSSYR KENKLNIELN NDEFLKSTFS MKVPDKN