Gene Acid345_3543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3543 
Symbol 
ID4069275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4190727 
End bp4192286 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content63% 
IMG OID637985566 
Productthiol oxidoreductase-like 
Protein accessionYP_592618 
Protein GI94970570 
COG category[C] Energy production and conversion 
COG ID[COG3488] Predicted thiol oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00122263 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGGAT GGCAAAGAAT CCTTCGTATC GTGGGTCTAG GACTGGCCTT ACCAAGCGGA 
CTTGCGTGGG CGCAACTTGT GGACAACACC CAGGCGACCA GCACCGCTAA GGCCGGCATT
AATAAGTCTC TCTCGCTAGA AGTTGGCGCG GGACGCGGCG ACTGGAATAC GCCAGACTCG
TCGGTCTTCA TCATCAATCG CGATCCGTTC CGGTCGGTCC GGCGCGGACG TCAGCTCTTC
CAGCGCAAAT TCACCCGGTT GGAGGGTACC GGCCCGAACG ACAGCGACGG TGTCGGCGAT
ATCGGCCTCA ATAACGCTAT CGGCGCAGGA CTCTCTGACA GTTGCGCGCT CTGCCATGGA
CGCCCGCGCG GCTCTGCCGG CGTAGGCGGC AACGTCAACA CCCGTCCCGA CAGCCGCGAC
GCTCCACACC TCTTTGGCCT CGGCCTCCGC GAGATGCTCG GCGACGAGAT CACCGGCGAC
CTGCGGATGA TCCGCACCGT GGCCCAACTC CAGGCCGAGA CCTACCATCG CCCGATCACC
AAGGCGCTGG TGAGCAAAGG CATCAGCTTC GGATCCATCA CTGCGAATCC CAACGGCTCG
TTCGATACGT CGAAGGTATC CGGTGTTGAC CCGGACCTGC GCGTGAAGCC ATTCTCCGCC
GAAGGCAGCG ATTTCTCGAT GCGTGCCTTC ATTGTCGGTG CGTTCCGGAA AGAAATGGGG
ATGCTCGATG CCTACGATCC GGACGTAGCG GCGGCCAGCG CAGGCGCCCG GGTGGTGACG
CCCTCCGGCA TGGTGCTCGA TGGCATGAAG GACACCATCA CTCCGCCTCC GTCGCCCGAT
GCCGCACACG GAAACGTAGA CCCGGCGCTG GTCGACCACC TCGAGTTCTA TCTTCTCAAC
TACTTCAAGC CTGGACACGG CGAGCAAACG TCGCAAGTCC GCGAGGGACG GCGCATCTTT
AACGACATCG GCTGCGCCCA GTGTCACGTC GCCAACCTGA CCATCAATCA CGACCGTCGC
GTCGCGGACC TGGAGACGGT GTACGACCCC ACCCGCGGCA ACTTTAACAG CTTGTTCGCG
ACCGCAACGC CGCTGATCAA AGTCACCGAC GACGGTTCCG GCCAGCCAAC GTTGAAGCAG
CCCGCGGGCG GATCGTTCGT GGTGAAGGAC CTCTTCACCG ACTTTAAGCG CCACGATCTC
GGCCCGATGT TCTACGAACG CAACTGGGAC GGCACGATGC AGAAGACGTT CATGACGCGT
CCGCTGTGGG GCGTGGGAAG CACTGCGCCG TACGGACACG ATGGGCGCAG CATAACCCTG
GACGACGTGA TCCTGCGGCA TGGTGGGGAG TCGCAACACT CGCGAAACGC GTACGCCCGC
CTGCGCCGCG AAGACTCCCA GGCGATCCAG GCCTTCCTGA ACTCGCTGGT GCTCTTCCCG
CCGGACGATA CCGCCTCAAC CCTCGATCCT GGCGATCGCA CCAACCCGAA CTTCCCGCAA
GTTGGCCACG GCAGCATTAA GCTGACGGTG CTGTTCAACG ATCCGGGAGA TCCAGAATAG
 
Protein sequence
MCGWQRILRI VGLGLALPSG LAWAQLVDNT QATSTAKAGI NKSLSLEVGA GRGDWNTPDS 
SVFIINRDPF RSVRRGRQLF QRKFTRLEGT GPNDSDGVGD IGLNNAIGAG LSDSCALCHG
RPRGSAGVGG NVNTRPDSRD APHLFGLGLR EMLGDEITGD LRMIRTVAQL QAETYHRPIT
KALVSKGISF GSITANPNGS FDTSKVSGVD PDLRVKPFSA EGSDFSMRAF IVGAFRKEMG
MLDAYDPDVA AASAGARVVT PSGMVLDGMK DTITPPPSPD AAHGNVDPAL VDHLEFYLLN
YFKPGHGEQT SQVREGRRIF NDIGCAQCHV ANLTINHDRR VADLETVYDP TRGNFNSLFA
TATPLIKVTD DGSGQPTLKQ PAGGSFVVKD LFTDFKRHDL GPMFYERNWD GTMQKTFMTR
PLWGVGSTAP YGHDGRSITL DDVILRHGGE SQHSRNAYAR LRREDSQAIQ AFLNSLVLFP
PDDTASTLDP GDRTNPNFPQ VGHGSIKLTV LFNDPGDPE