Gene Acid345_3249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3249 
Symbol 
ID4072584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3848975 
End bp3850312 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content59% 
IMG OID637985270 
Productglycoside hydrolase, clan GH-D 
Protein accessionYP_592324 
Protein GI94970276 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.062257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAG CGATATTTGT CGCACTCATT TTCTCCGCTG TTTCGTGCGC GCAGACAGAG 
CTTGCCCCGA CACCGCCTAT GGGGTGGAAC AGCTGGAACT CCTACGGGCA GACGATCTCG
GAGGCCGACG TGCGCGCCAC TGCCGATGTG ATGGCGCAGA AGCTGAAAGC CTTCGGATGG
CAGTACGTGG TGATTGATGA AGGCTGGTAT GTGCTGAAGC CGGGCGGCGA CGCGAAGCTG
TACCAGTTCC GGATGGATGA GAACGGGCGC TTTCTTCCGG CGGAGAGCCG CTTCCCTTCC
GCGAAGGACG ACGGCATGAA GGCGATTGCG GATTACGTAC ATTCGCTCGG GCTGAAGTTT
GGGATCCATG TGATTCGCGG GATCCCGCGC GAGGCGGTGG CGAAGAATTT GCCGATCGCG
GGGAGTTCGT TTCATGCGGC GGAGGCGGCG GATACGAGCG ATGTGTGTCC GTGGAATGCG
TACAACTACG GGCTGAAGGA CAACGCCGCG GGGCAGGCCT ATTACGACAG CATTGCGAAG
CTGTATGCGA GCTGGGGCGT GGACTTCATC AAGATTGATT GCATTTCGTC GCACCCATAC
AAGGGCGCGG AGATACGGAT GATCAGCGAG GCGCTGCATA AGAGCGGGCG TGCGATGGTG
CTGAGCCTGT CGCCGGGGCC GACGCCGCTG GAGAAGGCCG ATGAGGTCGC GAAGTACGCG
GAGATGTGGC GCATCTCGGA TGACTTCTGG GACCACTGGG GCGTGTGGAA GGGACATGAG
TGGTCTCAAG GATTGAAGGC GCAATTCGAG AATGCGGCGA AGTGGAGCGC GCATCAGACG
CCGGGACATC GGCCCGACAA TGACATGCTG CCGTTGGGAT ATCTGGGGCC GCATCCGGGT
GAAGGCGAGG TCAGGAATAC GCAGTTCACG GCGGATGAGC AGAAGACGCT GATGACGCTG
TGGGCGATTT CGCGGTCGCC GCTGATGATG GGCGGGAACT TGACGAAGAT GGATGAGGCG
ACGCTGGGAG TGCTGACGAA TAAGTCCGTC ATAGCGCTTG ACCAGCAGTC TCACAACGGC
AAGCAAGTGA TCAACGATGG GAATACTGTG GTTTGGTTCG CGGAGGGCTT GTCTCCGATC
GCTGGATACG TTGCTGTGTT TAACATTTCT GATCGCGAGC AGTCCGTGAA GTATTCGTGG
AAGGAACTTG GTTTAGCGGA GGGAGCACGC GTTGCGCGAG ATGTTTGGGC GGAACGTTCT
TTGGGGAAAC AGGAGGGCGT TAACCTGAAG CTTGCGCCCC ACGCGTGTGT GCTGTACCAG
CTGTCATTGT TGAAATAG
 
Protein sequence
MKRAIFVALI FSAVSCAQTE LAPTPPMGWN SWNSYGQTIS EADVRATADV MAQKLKAFGW 
QYVVIDEGWY VLKPGGDAKL YQFRMDENGR FLPAESRFPS AKDDGMKAIA DYVHSLGLKF
GIHVIRGIPR EAVAKNLPIA GSSFHAAEAA DTSDVCPWNA YNYGLKDNAA GQAYYDSIAK
LYASWGVDFI KIDCISSHPY KGAEIRMISE ALHKSGRAMV LSLSPGPTPL EKADEVAKYA
EMWRISDDFW DHWGVWKGHE WSQGLKAQFE NAAKWSAHQT PGHRPDNDML PLGYLGPHPG
EGEVRNTQFT ADEQKTLMTL WAISRSPLMM GGNLTKMDEA TLGVLTNKSV IALDQQSHNG
KQVINDGNTV VWFAEGLSPI AGYVAVFNIS DREQSVKYSW KELGLAEGAR VARDVWAERS
LGKQEGVNLK LAPHACVLYQ LSLLK