Gene Acid345_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0901 
Symbol 
ID4069112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1127585 
End bp1129642 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content55% 
IMG OID637982908 
ProductBeta-galactosidase 
Protein accessionYP_589978 
Protein GI94967930 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.240206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAT TAAAACGGCT ATTGGTCTTC AGCGCAGTTG CTGGATTGCT CACCGTGTTC 
GCTCCAGCTC AGAAGACGAA CGCGCCATCG AATTCAGGAT TGATGCTGGG AGTTGCATGG
TATCCAGAGC AATGGCCCGA AGACCGCTGG GAGAAAGATC TCGTCCTCAT GCAAGCGGCG
GGAATCCACG TAGTGCGTAT CGGAGAGTTC GCGTGGAGCA CGATGGAACC GAGCGAGGGA
AAATACGAAC TCGATTGGCT CCAGCGCGCG ACCCGACTCG CTGCGAAGCA TCACATCGAC
GTCATTCTTG GCACTCCTAC TGCGGCCCCT CCAGTCTGGT TAACTCAGAA ATACCCCGAT
ACTTTGCTCA TCGACGAACA GGGAAAACGC GCGGTGCATG GCAATCGCGC CCACTTTTCA
TTCACGAGCC CACGCTACCG CGAGTTCTGC CGCAAGATTG CGGAAGAGAT GGCGCGGAAA
CTGGCGAAAG AGCCGAACGT TATCGGCTGG CAACTGGACA ATGAGGTTTC CAATCCGTCG
TATGACGGGT ACACACGCGC GCAGTTCCAG GAATGGCTGA AGGACAAATA TAAGGGTCTC
GACGCTCTCA ATCAGCACTG GACGACAGCC TACTGGAGTC AAACGTACTT CGATTGGTCG
CAGATTCCGA TCCCAGTGGG CGGCAATAAC CCCGGCCTCA TGCTCGATTG GAAACGGTTC
ATCACCGACA CTTGGTATAG CTATCTTTCG AATCAGATCG CGGCAATTCG CAAGTACGCG
CCGGCGTCCC AACCGATTTG TACAAACACA ATGGGACTCT TCGATGGTTT CGACCACTAC
AAAGTTGAAA GTGAACTCGA CATCGTCGCG TGGGACCATT ACGTCGGCCA GGGCCACCTT
AATCCGGACT TTGCCGGATT CATCCACGAC CTGAATCGCG GATTGAAGCA GAAGAACTTC
TGGGTGATCG AGACACAACC GGGCGCGGTG AATTGGCAAT CGATCAATAA CGTGTTGGAC
AAAGGTGAAG TGCGAGCGAT GGCTTGGCAC GACGTTGCGC ACGGCGCGGA CATGGTGAGT
TATTGGCAAT GGAGAAGCGC ACTGAATGGG CAGGAGGAAT ATCACGGAGT GCTGGTGGGC
GCGGACGGCA CCCCTGTTCC GGTTTACGAC GAAGTGAAAC AGGTTGGTAA CGACTTCGCG
AAGGCGTATC CAGTTCTCGC GGGAACGTCG CCGCATTCCG AAGTTGCGAT GCTGCATGAC
TACGACAGCC GTTGGGCGAT CGATTGGCAG AAGCACAATC GCAACTACGA TCAGATCAAA
ATCCATGTCT CGTACTACCA CGCCCTCCGA AAACTGGTCC ACAGCATAGA TGTCGTGAAT
CCTTCAGTTG CACTCAAAAA CTACAAAGTC GTCGTTGCTC CAAATCTGAG TCTCATTCCC
GACTCGCTGG CAAAGCATTT GCGTGAATAC GTGGAACAAG GGGGGCATCT CGTCCTCGGG
CCGAGAGCCG GCATGAAAGA CGAGTTCAAC GCGTTGCTTA CGGAACGACA GCCTGGAGCA
CTGGTCGATA CTCTCGGCGG GCGAGTGGAG CAATTCTATG CTTTGGATCA GGACGTTCCT
CTCGAAGGCC CTCTTGGGTC GGGGCACTCT TCCCTCTGGG CCGAACAACT TAGCGCAAAG
CCGGGAACGG ATGCCTTGCT CACCTTTGGA AAAAGCAACG GTTGGCTCGA CCGTCAGCCT
GCAATCATCT CGCGCAAAGT CGGGAAGGGA AGAATCACAT ACGTGGGGGC CGTGCTCGAT
GAATCCCTGA TGGACAAATT TGCCGGATGG ATATTCTCGA CCAGTGGATT GCATTCTGCA
TTTGGTCTCA TTCCGGATGG TGTTGATGTT TCGGAGCGAA GTGGCAATGG GAAAGACGTG
TTCGTGCTCA TTAACTTCAA GCAAGAGAAC CAAAGCGTGC AACTCCCGAA GAGCATGAAG
CGGGTACTCG CGAATGGGGA ATCGGTGAGT AGCGTAAATC TGCCGCCGTA CGGAGTGGAG
GTCCTGGAGG CGCAATGA
 
Protein sequence
MNALKRLLVF SAVAGLLTVF APAQKTNAPS NSGLMLGVAW YPEQWPEDRW EKDLVLMQAA 
GIHVVRIGEF AWSTMEPSEG KYELDWLQRA TRLAAKHHID VILGTPTAAP PVWLTQKYPD
TLLIDEQGKR AVHGNRAHFS FTSPRYREFC RKIAEEMARK LAKEPNVIGW QLDNEVSNPS
YDGYTRAQFQ EWLKDKYKGL DALNQHWTTA YWSQTYFDWS QIPIPVGGNN PGLMLDWKRF
ITDTWYSYLS NQIAAIRKYA PASQPICTNT MGLFDGFDHY KVESELDIVA WDHYVGQGHL
NPDFAGFIHD LNRGLKQKNF WVIETQPGAV NWQSINNVLD KGEVRAMAWH DVAHGADMVS
YWQWRSALNG QEEYHGVLVG ADGTPVPVYD EVKQVGNDFA KAYPVLAGTS PHSEVAMLHD
YDSRWAIDWQ KHNRNYDQIK IHVSYYHALR KLVHSIDVVN PSVALKNYKV VVAPNLSLIP
DSLAKHLREY VEQGGHLVLG PRAGMKDEFN ALLTERQPGA LVDTLGGRVE QFYALDQDVP
LEGPLGSGHS SLWAEQLSAK PGTDALLTFG KSNGWLDRQP AIISRKVGKG RITYVGAVLD
ESLMDKFAGW IFSTSGLHSA FGLIPDGVDV SERSGNGKDV FVLINFKQEN QSVQLPKSMK
RVLANGESVS SVNLPPYGVE VLEAQ