Gene Cmaq_0109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0109 
Symbol 
ID5709461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp133817 
End bp135649 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content46% 
IMG OID641274615 
ProductDNA topoisomerase type IA central domain-containing protein 
Protein accessionYP_001539953 
Protein GI159040701 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000681889 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGT TAATTGTTGC TGAAAAGAAC AGTGTGGCTA AGGCTATAGC CCAATACTTA 
GCTGAGGGTG GATACACATT AAGGAGAATT GGTATTGTAC CCGTCTACTT CTTTAAGGTT
AATGGGGAGT ATTGGGCATC CATGGGCCTA AGGGGGCATA TCCTTGACTT CGACTTTGAA
CACTCCTATA ATAATTGGAA CAGAGTGGAG CCGGGTAAGC TCCTTGACCT TGAGCCAGTA
ATGGTGATTA GGGGTTGGGA TAGGCCGTAC GTAACGGCGT TGGTTGAATT ATCGAAGCAG
GCTAGGGAAA TTATCCTCGC CCTAGACTCT GATGTTGAGG GTGAGGCAAT AGCCTACGAG
GTAATGCTTG TGACTAGGCT TAGGAAACCC ACCTTAAGGT TTAGGAGGGC ATTATTCTCA
GCGGTCACTA GGGATGATAT TAGGAGGGCA TTCAGTAAGT TAACAACAAT CAACGTTAAC
CTTGCTAGGA AGGTCTTCAC CAGAATGGTT ATTGACCTTA AGTACGGTGC AACATTCACT
AGGCTATTAA CCTTAAGCGC CAAGTCAAGT AAGGCGCCAT TAAATAGGGG TGAGTTCCTA
AGCTACGGCC CCTGTCAAAC ACCGGTGCTT AACCTAGTTG TTCAAAGAGC CTTGGAGAGG
GAGAATTTTA AGCCTGAGGT TTACTATAAG ATTAAATTAA TCATTGAGGC TAACGGTGAG
TTAATTGAAT TAGAGTCCAT TGATAAGTTT AAGAACCTTA AGGAGGCCCA GGAGGCGCTT
AGTAGGGTTA AGGCAGGTAA GGCGGTGGTT AAGAATATTG AGGCTAAGAG GGTTCACGTT
AATCCACCTA AGCCCCTTGA AACCGTGGAG CTTGAGAGGA GGGCTAGTAG ATTCCTCAAC
ATTAGGAGTA AGCAGACCCT TGATGCAGCA GAGGAGCTTT ACAGGCAGGG TTACATATCC
TACCCAAGAA CTGAAACCAA CATTTACCCA CCCACCCTGG ATTTAAGGGG TATTTTAAGG
AACCTAACCT CAACGTCAAC CTACGGTCAA TACGCAAGGC ATCTACTGGC TGGTGAATTA
AGGCCTACTG CAGGTAAGGA TAATGATAAC GCCCACCCGC CAATACACCC GGTTAAGGCT
GCTGATAAGC CTGAGTTAAT GGCTAGGTTC AGGGACTTTA AGTACTGGCT CATATACGAC
CTGGTGGTTA GGCATTTCCT AGCAACCCTA AGTCCCCCAG CCTTAATTGA GGAGCAGAGG
CTTACTGTTG ATGCTGGTGG AGTACTCTTC GAGGCTTCAG GCCTCAGGAT CATTAATGAC
GGCTACTTCA CGATTTACCC CTTCGAGAGA CCTAGGGCTA ATCCACTACC CTTAAGCGCA
TTAAGGATTG GTATGCAGGT CACTGTTAAG GATGCTAAGG TTGTTAAGAG GAAGACCACG
CCGCCACCCT ACTTAAGTGA ATCAGAGTTG CTTAGGTTAA TGAGGAAGTA CGGTATAGGT
ACAGACGCCA CTATGCAGGA CCACATACAT ACTAACGTTA AGAGAAGGTA CTTCAAGATA
ATTAAGGGGC AGTGTGTACC CACACCGCTT GGTAAAGCGT TAATAACCTC ATTATCCAAG
TACGCACCAA CGTTAATAGA CCCAAACTTC AGGAGTAGGA TGGAGTCCAT GCTTTCACTC
ATTGGTTCAG GGAAGGAGAT GCCTGACTCA GTGAGGAGGA GGCTTGAGGA GGAGGCCGCT
AGGGTTTACA CCTCAATGAA GCCTAATTCT AATCAACTCG GGGAAGAATT AGCTAAGGCG
TTGAGAAGCA TGGTTAATGA AAAGGGCGCT TAA
 
Protein sequence
MDKLIVAEKN SVAKAIAQYL AEGGYTLRRI GIVPVYFFKV NGEYWASMGL RGHILDFDFE 
HSYNNWNRVE PGKLLDLEPV MVIRGWDRPY VTALVELSKQ AREIILALDS DVEGEAIAYE
VMLVTRLRKP TLRFRRALFS AVTRDDIRRA FSKLTTINVN LARKVFTRMV IDLKYGATFT
RLLTLSAKSS KAPLNRGEFL SYGPCQTPVL NLVVQRALER ENFKPEVYYK IKLIIEANGE
LIELESIDKF KNLKEAQEAL SRVKAGKAVV KNIEAKRVHV NPPKPLETVE LERRASRFLN
IRSKQTLDAA EELYRQGYIS YPRTETNIYP PTLDLRGILR NLTSTSTYGQ YARHLLAGEL
RPTAGKDNDN AHPPIHPVKA ADKPELMARF RDFKYWLIYD LVVRHFLATL SPPALIEEQR
LTVDAGGVLF EASGLRIIND GYFTIYPFER PRANPLPLSA LRIGMQVTVK DAKVVKRKTT
PPPYLSESEL LRLMRKYGIG TDATMQDHIH TNVKRRYFKI IKGQCVPTPL GKALITSLSK
YAPTLIDPNF RSRMESMLSL IGSGKEMPDS VRRRLEEEAA RVYTSMKPNS NQLGEELAKA
LRSMVNEKGA