Gene Acid345_0579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0579 
Symbol 
ID4068938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp708066 
End bp709859 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content60% 
IMG OID637982584 
ProductBeta-glucuronidase 
Protein accessionYP_589658 
Protein GI94967610 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.588322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCA CTCACCGCAC TCAACGCCGC TCCGTGCTCA TCGTTCTAGT TCTTTTCGCC 
GCCGCGGCAA CTGCATTTGC ACAAACGCCT CTCATCGCCA ACATCCACGC GCGCCAGATC
ACCGATCTCT CCGGCACCTG GAGCACCATC ATTGACGCCT ACCACGTCGG CGAGCGCAAT
CGCTTCTACG AAGATCGACA CCAAACCGAT CCTTCTGAGC TCATCGAATA CAACTTCGAT
CACTCTCCGA CTTTGAAAGT CCCCGGAGAT TGGAACTCTC AACGCCCTGA GCTGTTTCTG
TACGAGGGCA CGCTCTGGTA TCGGCGAATC TTCTCCTACC ATCCAGCGCA GGGAAAGCGT
CAGTTCGTTT ACTTTGGCGC CGCGAATTAT CACGCGACCG TCTATCTCAA CGCGCAGAAG
CTTGGCGAAC ACAGCGGCGG CTACACGCCA TTCAATTTCG AAGTCACCGG CAAACTAAAA
GACGGCGAAA ACTTGCTCGT CGTGGAAGTG GATGATCGTC GCAGCAAGGA CGCCATTCCC
GCGCTCAATA CCGATTGGTG GAACTACGGC GGCCTCACCC GCGAAGTCTC CCTCGTAGAA
GTACCGAGTG CGTTCATCGA GAATTACTTC ATCCAGTTGG CGAAAGGTTC GCCGAACGAG
ATCGCCGGCT GGGTGCAACT CAACAGCGCC GCCGGGGGAT CCGAGGTAAC GATCGAGATT
CCCGAAGCCA AGCTCCGCCA GAGCGCCAAG GCAGATGAGC ACGGCCGCGC AGCCTTCCGC
TTCCCCGCGC AACTGACGCC GTGGTCGCCC GACAATCCCA AGCTCTACGA AGTTCGCATC
TCATCCGGCA CAGACCATCT GACCGACCAG ATTGGCTTCC GCACCATCGA AGCCCGCGGC
ACTAAGCTCT ACCTCAATGG CAAGCCGATC TTCCTGCGCG GCATCTCGAT CCACGAAGAA
GCGCCCTTCC GCGGCGGACG CGCCTTCGCA TCCGAAGACG ACAAAACGCT CCTCGGCTGG
GCGAAAGAAC TCGGCTGCAA TTACGTGCGA CTCGCCCACT ACCCGCACCA CGAGAGCATG
GTGCGCGAAG CCGAACGTAT GGGCATCCTC GTCTGGTCCG AGATCCCCGT TTACTGGGAC
ATTGACTGGA AAAACCCCGA CTCCCTCGCG CAGGCCCGCC AGCAACTTCA CGAAGAGATC
GCCCGCGACC AGAATCGTGC AGCGATTATT CTCTGGTCTA TCGCCAACGA AACTCCCATT
GATCCCGACC GTCTTGAGTT CCTGAAAGCC CTGGCCTCCG ACGTTCGCTC GCTCGACAAC
ACTCGCCTGC TGACCGCCGC CCTCAATCGC ACCGGACGCG AAGGTAAAAC CCGCCTCATC
GACGACCCGC TCGGCGCCGT TGTGGACGTG CTTGCGATCA ACGAGTACAT CGGCTGGTAT
GAAAGCCGGG TCGAAGATGC CGACACTACT GAATGGAAAT CCTCGTGGGA GAAGCCGCTG
CTGTTCAGCG AATTTGGCGG CGGCGCGCCC TATGGACGTC ACGGCGCAAC TAACGAACGT
TGGACCGAGG AGTACCAGGC AAACCTCTAC CGCCATCAGC TGACCATGCT GCGCAAGATT
CCCGCCCTCG CCGGACTCTC GCCGTGGGTG CTCATGGACT TCCACTCGCC CGTGCGCCTG
CTGCCCGGCG TGCAGGACAT GCGCAACCGC AAAGGCCTCG TCTCCGACCA GGGCCAGCGC
AAGCAAGCCT TCTACGTGTT ACAGGAGTAC TACCGCGAAA TGGGTGCGCA CTAG
 
Protein sequence
MPTTHRTQRR SVLIVLVLFA AAATAFAQTP LIANIHARQI TDLSGTWSTI IDAYHVGERN 
RFYEDRHQTD PSELIEYNFD HSPTLKVPGD WNSQRPELFL YEGTLWYRRI FSYHPAQGKR
QFVYFGAANY HATVYLNAQK LGEHSGGYTP FNFEVTGKLK DGENLLVVEV DDRRSKDAIP
ALNTDWWNYG GLTREVSLVE VPSAFIENYF IQLAKGSPNE IAGWVQLNSA AGGSEVTIEI
PEAKLRQSAK ADEHGRAAFR FPAQLTPWSP DNPKLYEVRI SSGTDHLTDQ IGFRTIEARG
TKLYLNGKPI FLRGISIHEE APFRGGRAFA SEDDKTLLGW AKELGCNYVR LAHYPHHESM
VREAERMGIL VWSEIPVYWD IDWKNPDSLA QARQQLHEEI ARDQNRAAII LWSIANETPI
DPDRLEFLKA LASDVRSLDN TRLLTAALNR TGREGKTRLI DDPLGAVVDV LAINEYIGWY
ESRVEDADTT EWKSSWEKPL LFSEFGGGAP YGRHGATNER WTEEYQANLY RHQLTMLRKI
PALAGLSPWV LMDFHSPVRL LPGVQDMRNR KGLVSDQGQR KQAFYVLQEY YREMGAH