Gene Jann_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3602 
Symbol 
ID3936078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3678087 
End bp3680282 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content61% 
IMG OID637905978 
Productexcinuclease ABC subunit B 
Protein accessionYP_511544 
Protein GI89056093 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.510357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.935221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAACA ATTCGCCCGA ACAGATGCCT GACACGGGCC GCGCCGTGGA CGCGCTGCGC 
GCACAATTGA AAATGGAGGG TGGCAAAGCC TTTGTCCTCC AGACCGAGTT TGAACCGGCA
GGCGATCAGC CCACCGCAAT CGCAGAATTG TCCGAGGGGA TTCGCGAGGG TGAACGGGAC
CAGGTGCTCC TTGGCGCAAC GGGAACTGGT AAGACCTTCA CCATGGCAAA GATGATTGAG
GAAACGCAGC GCCCCGCCAT CATTCTTGCG CCCAACAAGA CGTTGGCCGC GCAGCTTTAT
GCGGAGTTCA AGAACTTCTT CCCGGAAAAC GCCGTCGAAT ACTTCGTCAG CTACTACGAC
TATTATCAGC CCGAGGCCTA TGTGGCGCGG TCCGACACCT ACATCGAGAA AGAAAGCCAG
ATCAACGAAC AGATCGACCG CATGCGCCAC TCGGCCACGC GGGCCCTGTT GGAACGGGAC
GACGTTATCA TCGTGGCATC CGTGTCGTGT ATCTACGGCA TCGGCAGCGT GGAAACCTAC
GGTGCGATGA CCCAGGATCT GCACGCAGGC CGGGAATATG ACCAACGCAA GGTCATCGCC
GATCTGGTGG CGCAGCAATA TCGCCGCAAC GACGCCGCCT TTCAGCGGGG GTGTTTCCGC
GTGCGCGGTG ACAGTCTGGA AGTCTGGCCC GCCCACTTGG ATGATCGCGC CTGGAAGCTG
TCGTTTTTTG GGGAAGAGCT GGAAAGCATC ACCGAATTCG ACCCCCTGAC CGGTCAGAAG
ACGGACACGT TCGAGAAGAT CCGCGTCTAC GCGAATTCCC ACTACGTCAC GCCGCGCCCC
ACGATGCAGC AGGCCATGAA GGGCATTAAG TCTGAGCTGG CGATGCGCCT GAAACAGATG
ATTGACGAGG GCAAGCTGCT GGAGGCCCAG CGGCTGGAAC AACGCACCAA CTTCGATCTG
GAGATGCTGG AAGCCACCGG CGTCTGCAAC GGGATCGAGA ACTACTCCCG CTATCTGACG
GGCCGCGCGC CGGGTGAACC GCCCCCCACC CTGTTCGAAT ACATCCCCGA TAATGCGATT
GTTTTCGCCG ACGAAAGCCA TGTTTCCGTG CCGCAGATCG GCGGGATGTA CCGCGGCGAC
TACCGGCGCA AATTCACGCT GGCAGAACAC GGCTTCCGCC TGCCCTCCTG CATGGACAAC
CGCCCCCTGA AGTTTGAGGA ATGGGACGCC ATGCGGCCGC AATCCGTCTT CGTATCCGCC
ACACCCGCCG CGTGGGAGTT GGAGCAGGCG GGCGGCGTTT TCACCGAACA GGTCATTCGC
CCAACAGGCC TGATCGACCC GCAGATCGAG ATCCGACCGG TGGATATGCA GGTCGATGAT
CTGCTGGATG AGGTCCGCAA AGTGGCCGCC GATGGTTATC GCACGCTGGT CACGACGCTC
ACCAAGCGCA TGGCCGAAGA TCTGACGGAA TACATGCACG AACAGGGCAT CCGCGTGCGC
TACATGCACT CGGATATCGA CACACTGGAA CGGATCGAAA TCCTGCGCGA CCTGCGGTTG
GGCGCGTTTG ACGTGCTGAT CGGGATCAAC CTGCTGCGCG AGGGTCTCGA CATTCCGGAA
TGCGGGTTGG TGGCCATTCT GGATGCCGAC AAGGAAGGCT TTTTGCGGTC CGAGACGTCC
CTGATTCAGA CCATTGGGCG GGCCGCGCGC AACGCCGACG GTCGCGTGAT CATGTATGCG
GACAAGATCA CCGGCTCCAT GGAGCGCGCC ATGCGCGAGA CCGAGCGGAG ACGGGTCAAG
CAGTTGGCGT ATAACGAAGA ACACGGCATC ACGCCCGCAA CGATCAAGAA GAATGTCGAT
GACATCCTGA TGGGCGTCTA CCAGGGTGAC ACCGACCAAA GCCGCGTCAC GGCCAAGGTC
GACAAACCGC TTGTGGGCGC GAACCTTGCG GCGCATCTCG ACGGCCTGCG CGACAAGATG
CGCAAGGCGG CGGAGAACCT GGAGTTTGAG GAAGCTGCGC GCCTGCGTGA TGAGGTCAAG
CGGTTGGAGA CGGTGGAACT GGCCATCGCC GACGACCCGC TGGCCCGCCA GTCAGCGGTG
GAGGCGGCGG TGGAAGATGC GTCAAAGGCC AGCGGCCGGT CAACGGCAGG TCGGGGCGGT
CAACGGGGTG GGAACGTCAA GCGGCGGAAG CGGTAG
 
Protein sequence
MHNNSPEQMP DTGRAVDALR AQLKMEGGKA FVLQTEFEPA GDQPTAIAEL SEGIREGERD 
QVLLGATGTG KTFTMAKMIE ETQRPAIILA PNKTLAAQLY AEFKNFFPEN AVEYFVSYYD
YYQPEAYVAR SDTYIEKESQ INEQIDRMRH SATRALLERD DVIIVASVSC IYGIGSVETY
GAMTQDLHAG REYDQRKVIA DLVAQQYRRN DAAFQRGCFR VRGDSLEVWP AHLDDRAWKL
SFFGEELESI TEFDPLTGQK TDTFEKIRVY ANSHYVTPRP TMQQAMKGIK SELAMRLKQM
IDEGKLLEAQ RLEQRTNFDL EMLEATGVCN GIENYSRYLT GRAPGEPPPT LFEYIPDNAI
VFADESHVSV PQIGGMYRGD YRRKFTLAEH GFRLPSCMDN RPLKFEEWDA MRPQSVFVSA
TPAAWELEQA GGVFTEQVIR PTGLIDPQIE IRPVDMQVDD LLDEVRKVAA DGYRTLVTTL
TKRMAEDLTE YMHEQGIRVR YMHSDIDTLE RIEILRDLRL GAFDVLIGIN LLREGLDIPE
CGLVAILDAD KEGFLRSETS LIQTIGRAAR NADGRVIMYA DKITGSMERA MRETERRRVK
QLAYNEEHGI TPATIKKNVD DILMGVYQGD TDQSRVTAKV DKPLVGANLA AHLDGLRDKM
RKAAENLEFE EAARLRDEVK RLETVELAIA DDPLARQSAV EAAVEDASKA SGRSTAGRGG
QRGGNVKRRK R