Gene Acid345_1677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1677 
Symbol 
ID4069345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2028162 
End bp2029388 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content57% 
IMG OID637983685 
ProductAlpha-galactosidase 
Protein accessionYP_590752 
Protein GI94968704 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.343798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGGAGGC CCGGCCCGCA GATGAAACGC TTCGTCGCAT CCATCCCTCT CGTAATAATT 
TTTCTTAGCA TCATTGCTGC CGCGCAAGAA TCCCCAAAGC CGCCGGACAT CAACGCCTAC
GCGCCAACCC CGCCGATGGG ATGGAACACG TGGAACAAGT TCGCCTGCAA TGTTGACGAG
AAACTCATTC GTGGCGCCGC CGATGCACTC GTCTCCTCTG GCATGAAAGA CGCTGGTTAC
CATTACCTCG TCATTGACGA CTGCTGGCAA GTCAGCCGCG ACGGGAAAGG CAACATCGTT
GCCGATCCAC AACGATTCCC TTCCGGCATC AAGGCCCTCG CCGCGTACGT CCACCACAAA
GGACTGAAAT TCGGTATCTA CTCCGACATC GGCACCAAGA CTTGCCAAGG CCGGCCCGGA
AGCCGGGGCC ATGAATTTCA AGACGCCCTG CAATACGCCG CATGGGATGT CGACTATCTC
AAACTCGATT GGTGCAACAC TGGCACCCAG AACGCCCCAG CCTCGTACAA AACCATGCGC
GATGCGATTG ATGCCACGGG CCGTCCCATC GTTCTAAGCA TCTGCGAATG GGGCAAGAGC
AAGCCGTGGC TCTGGGCCAA AGACACCGGT AATCTCTGGC GCACCACCGA CGACATTCAG
GATCGCTGGG CCGGCAAGAA AAAATGGGAT GAAGCCTCAT GCTGCTCCAA CGGCATGCTC
GACATTCTCG ATGAAGAAGC CGACATCGCC AGTTACGCCG GTCCCGGACA CTGGAACGAT
CCCGACATGC TCGAGGTCGG CAACGGCGGC ATGACCAACG TCGAATATCG TTCCCACTTC
AGCCTATGGG CAATCCTCGC TGCGCCTCTC ATGGCCGGCA ACGATCTCAC CAACATGCGC
CCCGAAATTC GCGAAATCTT GACCAACAAA GAAGTGATCG CCGTAGATCA AGACCCGCTC
GGCAAGCAGG GCGTTCGCGC AGCGAAGAAC GGCGATCTCG AAATCTGGAG CAAAGTGCTT
GCCGACGGAT CTCGCGCCGC TGTCCTCCTC AACCGCAGTG CATCGGAACA AGAGATCACC
GCGCGATGGA TCGACCTCGG TTATCCCGCG ACACTCTCCG TGAGGGTCCG AGACCTCTGG
GCGCACAAAG ATCTCGGTTC TTTCAAAGAC AAGTTTTCCG CCAAAGTTCC ATCGCACGGT
GTCGTTATGA TTCACCTGCA ACCGTAA
 
Protein sequence
MRRPGPQMKR FVASIPLVII FLSIIAAAQE SPKPPDINAY APTPPMGWNT WNKFACNVDE 
KLIRGAADAL VSSGMKDAGY HYLVIDDCWQ VSRDGKGNIV ADPQRFPSGI KALAAYVHHK
GLKFGIYSDI GTKTCQGRPG SRGHEFQDAL QYAAWDVDYL KLDWCNTGTQ NAPASYKTMR
DAIDATGRPI VLSICEWGKS KPWLWAKDTG NLWRTTDDIQ DRWAGKKKWD EASCCSNGML
DILDEEADIA SYAGPGHWND PDMLEVGNGG MTNVEYRSHF SLWAILAAPL MAGNDLTNMR
PEIREILTNK EVIAVDQDPL GKQGVRAAKN GDLEIWSKVL ADGSRAAVLL NRSASEQEIT
ARWIDLGYPA TLSVRVRDLW AHKDLGSFKD KFSAKVPSHG VVMIHLQP