Gene Acid345_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1689 
Symbol 
ID4069357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2048956 
End bp2050053 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content56% 
IMG OID637983697 
Productglycoside hydrolase family protein 
Protein accessionYP_590764 
Protein GI94968716 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0715346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGCC GTCTTGCAGT TGTCTTTGCA TTGGTATCCA TTGTTGCGTT TGCGCAGGAC 
TCACCGGCAT TCCGCCGCGC GCAACATCTG CGCCACGGGA TCAACGTCAG TGAATGGTTT
GCCCAATCGA ACGACTACAG CCCGCAGCGG CTGCGCACGT TCACAAAACT CGAAGACATT
GATCGCATTA AGAAGATGGG CTTCGATCAC ATCCGCATCA GCATTGATCC GCAGGTCTTC
GACTGTTTCC GCTGGAACTC GAACTGTGAG CGCGTGCAGG TGCTGGATGA CGTGGTGAAC
CGGGCGGCCA ATCAGGACCT CGCGGTGATG ATTGATCTTC ATCCGAACAG CGAGTACAAG
AAACAAGTCG CAACGAACGG TGACGAAGCA CAACGTGCTG CACTGCTTTG GGAGCGCATC
GCCGCCCACT ATGCAAAGTC GAATCCGGAT CTACTCTTCT TCGAGTTGAT GAACGAGCCG
GAACTCGGCG ATGCGTACAA ATGGAACGGA ATCCAGGAGG CCCTCGCGGC ATCCGTGCGG
CGGGCGGCGC CGCAGCACAC GATCATCGTT GCGGGAGCTC GCTACTCCGA TATTGAAGAC
CTGGTCAGGA TGCCCGACTT TGCCGATCAC AATCTGATCT TCAACTTCCA CTACTACGAG
CCCCACACCT TCACGCATCA AGGCGCTAGC TGGGGATCTG CCTATTGGCT ACGCACGCAG
GATTTGCCGT TCCCGGCAAC ACCAGAGAAC ATGGCACCAA TTCTCGCGCA GCAACCTGAT
GACTTTACGC GCTGGCAACT GGAGGAATTT GCGCTCTCCA ACTGGAATGC CGCCCGCATT
GAAGGAGAGA TGGCCTTTGC CGCCGATTGG GCTAAGCGCC ACAACGTTCC GCTGATTTGC
AATGAGTTCG GAGCTTATCG CGAACACACT CGTCCCGAGG ATCGTATGCG CTGGATTTCT
GCGGTACGCG GCGCGTTGGA GAAAAATCAT ATCGGCTGGA CGATGTGGGA TTACCAAGGC
GGGTTCGGTG TGGTTCACGT GAAGGATGGT CAAGTCACCG AAGACGCTGC GGTACTGCAA
GCGTTGGGGC TGAAGTAG
 
Protein sequence
MLRRLAVVFA LVSIVAFAQD SPAFRRAQHL RHGINVSEWF AQSNDYSPQR LRTFTKLEDI 
DRIKKMGFDH IRISIDPQVF DCFRWNSNCE RVQVLDDVVN RAANQDLAVM IDLHPNSEYK
KQVATNGDEA QRAALLWERI AAHYAKSNPD LLFFELMNEP ELGDAYKWNG IQEALAASVR
RAAPQHTIIV AGARYSDIED LVRMPDFADH NLIFNFHYYE PHTFTHQGAS WGSAYWLRTQ
DLPFPATPEN MAPILAQQPD DFTRWQLEEF ALSNWNAARI EGEMAFAADW AKRHNVPLIC
NEFGAYREHT RPEDRMRWIS AVRGALEKNH IGWTMWDYQG GFGVVHVKDG QVTEDAAVLQ
ALGLK