Gene Acid345_4634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4634 
Symbol 
ID4070791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5493171 
End bp5495195 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content60% 
IMG OID637986674 
Producthelicase c2 
Protein accessionYP_593708 
Protein GI94971660 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAGTA GCTCACAATC CGCGGCAAGT CCGGCTCGCG ACATCCGTTC GCTCACCCTG 
TCGCAGTTCT TCGGACCGGG CGGTGTGCTG GCGAAATCGC ATCCGGCATA TGAGTTCCGG
CGCGGGCAGT TGCAGATGGC GCAAGCCGTG GAACAGGCGC TCACCGAGAA GCGGCACCTG
ATCGTTGAGG CCGGGACAGG AACCGGCAAG ACGTTGGCGT ATCTCGTGCC AGTGATTCGC
TCGGGACTGC GCGTGATCAT TTCGACGGGC ACGAAAAATC TCCAGGAGCA GCTCTTCTAT
AAAGATGTTC CCTTCCTCGA GCGCGCGATT TTCGGCGCGG AGTCAGACCA GAAGTTGAAG
GTCTGTTACA TGAAGGGGCG CAACAATTAT CTGTGCCGCA AGAAGCTGTA TGACCTAAGC
GACCGGCCGG TGCTCAACGG CCTGGAAGAG ATTGACCAGT TCCGGCAGAT CCGCGAGTGG
GAGGCCGTGA CCACGACCGG CGACCGCGCG GAGTTGACCG CCGTTCCCGA GGCCAGCCTG
CTCTGGCCAA AGCTCGACGC GCGCGCGGAC GCGTGTGTCG GCTCGAAGTG CAAAGATTTC
GAGCGCTGCT TCATCACCGA GATGCGTCGT ACGGCAATGG AGAGCGACAT CATCATCGTC
AATCACCATC TGTACTTCGC CGATCTCGCG ATTAAGCAGG CGGCGGATGG CGCGCCGGAT
GTCGGCATTC TGCCGGAAGC CGCGGCAGTA ATCTTCGACG AGGCGCACGA ACTCGAAGAC
GTCGCGGGCA GCTACTTCGG GGTGAGCGCG TCGAACGTGC GCGTGGACGA CCTGGTGCGC
GATGTTGAAT TCGCAATCAA GGAATTTCAT ATCGCGTCTC CGACGCTGTT GCAGGCTTCG
CAGCGAGCCC GCGAACGCTC GCAGTTGTTC TTCTCGCTGG TTCCACAGGG CGAAGGGCGA
TTCGCGTTCG AGAACCGTAA GGAATTTCTG GAAGAGAACG GGGAAGAGTT TCTCGCGCTG
CAGAATGCGC TGGGGCATCT CTACAGTGAA ATTCAGGCGT TAAAAGAGAA ACCGGATGAG
TTGTTCAACC TGGCTCGACG CACCGAAGAA CTGCGAGTAC AGCTGAGTTT CCTGCTGGAA
TCGAATGACA AGAACACTGT GTATTGGGTG GAGCGTCGAG GCGAGTCGCG CCGCGCTGGA
AGGCAGGGAC ACAACGTGTT TTTGCAGGCG ACGCCGATTG ATGTGTCGGC AATCCTGCAG
CGCACCTTGT TTGCAAATTT GAACACCGCG GTGTTGACCT CGGCGACGAT CGCCGTCGGC
GGCGGCTTTC AATACGCCCG CGGACGATTA GGGCTACAGG ATGCACGCGA ACTCGTCGTG
CCCTCGCATT TCGACTACGA GACGCAAGCG GTGTTCTACG TGCCGCCAGA TATGCCTGAT
CCGCGCGAGC AATTGTTTTC GCGCAAGGCG GCCGATCGCA TCCGCCAACT GCTGGAGATC
ACGCGTGGGC GCGCCTTCTG CCTGTTCACC AGCTACGCGC AGATGAACGA CGTCTACGAC
CGGCTGCTCG GGGAACTCGA TTATCCGATG CTGAAGCAGG GCGATGCACC GAAATCAGCA
CTGCTCGAAG AGTTCCGCAC TACGCCCGGT GCGGTGCTGT TCGCGACCAG TTCTTTCTGG
CAGGGCGTGG ATGTGCAGGG CGAACAGCTG AGCTGCGTCA TCATCGACCG GCTGCCGTTC
GCGGTGCCGA ACGATCCCGT GGTTGCCGCG CGGATCCGCG CCATTGATGC CGATGGCGGC
AATGCGTTCT TCGACTACCA GGTGCCGAGC GCGGTCATCG CGTTGAAGCA GGGCTTCGGC
AGGCTGATCC GCTCCCTGCA CGATCGCGGC GTGGTGTGTC TACTCGATAA TCGAATTCTG
AAGAAGCAAT ACGGACGTGT GTTTGTGGAG AGCCTGCCAA AATACTGGCG TACGACCGAC
ATTCGTAAGG TCGAGGAGTT TTTCGCAGGG CCCGAAGCGG ATTAG
 
Protein sequence
MASSSQSAAS PARDIRSLTL SQFFGPGGVL AKSHPAYEFR RGQLQMAQAV EQALTEKRHL 
IVEAGTGTGK TLAYLVPVIR SGLRVIISTG TKNLQEQLFY KDVPFLERAI FGAESDQKLK
VCYMKGRNNY LCRKKLYDLS DRPVLNGLEE IDQFRQIREW EAVTTTGDRA ELTAVPEASL
LWPKLDARAD ACVGSKCKDF ERCFITEMRR TAMESDIIIV NHHLYFADLA IKQAADGAPD
VGILPEAAAV IFDEAHELED VAGSYFGVSA SNVRVDDLVR DVEFAIKEFH IASPTLLQAS
QRARERSQLF FSLVPQGEGR FAFENRKEFL EENGEEFLAL QNALGHLYSE IQALKEKPDE
LFNLARRTEE LRVQLSFLLE SNDKNTVYWV ERRGESRRAG RQGHNVFLQA TPIDVSAILQ
RTLFANLNTA VLTSATIAVG GGFQYARGRL GLQDARELVV PSHFDYETQA VFYVPPDMPD
PREQLFSRKA ADRIRQLLEI TRGRAFCLFT SYAQMNDVYD RLLGELDYPM LKQGDAPKSA
LLEEFRTTPG AVLFATSSFW QGVDVQGEQL SCVIIDRLPF AVPNDPVVAA RIRAIDADGG
NAFFDYQVPS AVIALKQGFG RLIRSLHDRG VVCLLDNRIL KKQYGRVFVE SLPKYWRTTD
IRKVEEFFAG PEAD