Gene Acid345_3886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3886 
Symbol 
ID4072221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4595272 
End bp4596795 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content57% 
IMG OID637985910 
Productlevanase 
Protein accessionYP_592960 
Protein GI94970912 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGGCTT CCGTATGGCG ATCTGGCTTC CGCAATGCAA TATTGATAGA TTCTTCTCCC 
ATGCGCCACG TTCTCGCTCT GCTTGCGTTG ATTGCCGTTG CACTTCCTGC TCTTCCCCAA
TATGACCAGC CGTATCGTCC GCAGGTGCAT TTCTCGCCGC GCGAGCATTG GACGAACGAT
CCTAACGGGC TGGTGTTCTT TGACGGCGAA TATCACCTCT TCTTCCAGTA CAACCCGTTC
GGCGATGTGT GGGGACACAT GAGCTGGGGA CACGCCGTGA GCAAAGATTT GCTGCATTGG
GAGGAACTGC CCGTCGCGGT TCCGGAGAAG GACGGCGTGA TGATCTTCAC CGGCAGCGTC
GTCGTGGACC ACGAGAACTC GAGCGGGTTC TGCAAGCCGA AGACGGAATG CCTGGTTGCG
ATTTACACCG GATATCAAGA ACACTTCCCC GGCGGGACGC GGCAGGCCCA GTACGTGGCG
TACAGCGTGG ACCGCGGGCG GACGTGGACG AACTACGACA AGAACCCCGT GATTGATTTG
AAGATGGCGG ATTTCCGCGA CCCCAGCGTG TTTTGGGACG AGGAGCGACA CCGGTGGGTG
ATGGCGGTGT CGCTGCCGAA AGAACACGAT GTGCAGTTCT ACAGTTCAAC GAACTTGAAA
CAGTGGGCGT TGCTGAGCGA GTTCGGACAG CTCGGAGATA CGGACGGTGA TTGGGAGTGT
CCCGATCTTC TGCGAGTACC TTCTGCGCAG GATCCGACGA AGAGCACGTG GGCGTTGAAG
GTCGGGCTGA ATCCTGGGGC ACCGCAGGGC GGATCAGGGG AGCAATACTT CTTCGGTGCT
TTCGACGGCA AGACATTTAC CGCATCGCAC GAGAAGGGCG CGCATGGCTG GACGAACTAC
GGCAAAGACG ATTACTGCGC CATTAACTTC AACAACATCG CGAAAGATGA GAAGCCCGTT
CTGCTGGGCT GGATGAGTAA TTGGGAATAC GCTGCAAAGT TGCCGACATC TCCGTGGCGT
GGACAAATGA GTTTGCCGCG TAGGCTCTCG TTCGTGAAGG ACGTGGAAGG ATTGGGGCTG
AAGCAGGAGC CGGTGGTGGA AACACTGCGC GATGGGGCGG CGACGACTCT CACGTCGGCT
CCACGAGAAG CGCCGTTCGA GTTGCAGGTG ACGTTCGATC CGAAGGCGGA GCAAATCTTT
GGAATGCGGA TTTACTCTGA CAAAGAACAC TACGTTGAGA TTGGCTTCGA CCGAAAGAAC
CAGCAGCTCT TTATGGACCG CACGAAGTCG AGCGTGACGG TGGCGCAGGA GTTTCCGGGC
ACGACAGTTG CACCGCTCAC AGAAGGACGT GGGTTCGATC TGCACGTGAT TGTGGATCGA
TCGTCGGTTG AGGCGTTTGC GCAAGATGGC ACCATCGCGA TGAACAACCT GGTGTTTCCC
ACAAAGCCGC AAGTGCGCGT CGAGACATTT GGCAGCAAGC CGACATCGGC ACAAGTCTGG
AAGCTGAAGT CCATTTGGAA GTAA
 
Protein sequence
MLASVWRSGF RNAILIDSSP MRHVLALLAL IAVALPALPQ YDQPYRPQVH FSPREHWTND 
PNGLVFFDGE YHLFFQYNPF GDVWGHMSWG HAVSKDLLHW EELPVAVPEK DGVMIFTGSV
VVDHENSSGF CKPKTECLVA IYTGYQEHFP GGTRQAQYVA YSVDRGRTWT NYDKNPVIDL
KMADFRDPSV FWDEERHRWV MAVSLPKEHD VQFYSSTNLK QWALLSEFGQ LGDTDGDWEC
PDLLRVPSAQ DPTKSTWALK VGLNPGAPQG GSGEQYFFGA FDGKTFTASH EKGAHGWTNY
GKDDYCAINF NNIAKDEKPV LLGWMSNWEY AAKLPTSPWR GQMSLPRRLS FVKDVEGLGL
KQEPVVETLR DGAATTLTSA PREAPFELQV TFDPKAEQIF GMRIYSDKEH YVEIGFDRKN
QQLFMDRTKS SVTVAQEFPG TTVAPLTEGR GFDLHVIVDR SSVEAFAQDG TIAMNNLVFP
TKPQVRVETF GSKPTSAQVW KLKSIWK