Gene Ent638_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2022 
Symbol 
ID5113438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2195080 
End bp2196489 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content55% 
IMG OID640492210 
Productbeta-fructofuranosidase 
Protein accessionYP_001176749 
Protein GI146311675 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID[TIGR01322] sucrose-6-phosphate hydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0870629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTC CTTCTCGCTG GCCTGCGGTA CTGCAGGCCG TCATGAAAGG CCAACCGCAG 
ACGTTAACTG ACGATCATTA TCCTCAGTGG CATCTTGCGC CGGTGATCGG ATTGATGAAC
GATCCCAACG GTTTTATCTG GTTTGCCGGA CGTTACCATC TGTTCTATCA GTGGAACCCG
CTGGGCTGCG ATCATCGTTA TAAATGCTGG GGCCACTGGA GTTCGGCTGA TTTAGTGCAG
TGGCAGCATG AACCGATGGC GCTGATGCCG GATGAAGAAT ACGACCGTAA CGGCTGCTAC
TCCGGTAGCG CCGTGGATAA TCAGGGCGTT TTAACGCTGT GCTATACCGG CAACGTCAAA
TTTGATGACG GCAGTCGCAC CGCATGGCAA TGCCTGGCAG TACAAAATGA AGCCGGTGGT
TTCGATAAGC TCGGTCCAGT ATTGCCACTA CCGGAAGGTT ATACCGGCCA TGTGCGCGAC
CCAAAAGTGT GGCAGCACGA TGGGCAGTGG TACATGGTGC TGGGCGCACA GGATTTACAA
AAGCGCGGCA AAGTGCTGCT GTATCAGTCT GCGGATTTGC ACAGCTGGCA GCCCCTCGGC
GAAATCGCGG GCCACGGCGT GAACGATCTG GCCGATGCAG GGTATATGTG GGAGTGTCCG
GACCTGTTCG AGCTGGACGG TACGCACGTG TTGATCTGCT GCCCGCAGGG ACTGGCGCGC
GAACCGCATC GCTATCGCAA TACCTATCCC TCTACCTGGA TGAGCGGCGA TTTTCACTAT
GACGATGCGA AATTTGAACA TGGCGCACTG CACGAACTGG ATGCCGGGTT TGAATTTTAT
GCCCCGCAAA CGACATTAGC GGCAGATGGC CGTCGGATCC TGATTGGCTG GATGGGCGTC
CCGGACGGGG AAGAAGTACT ACAACCAACC TGCGAACACG GCTGGATCCA TCAGATGACG
TGTCCGCGCG AGCTGCATTT CCGCGACGGA AAACTTTTGC AAACACCGAT TCGGGAACTG
CAACAGCTGC GCGAGGAAGA GCAAAACTGG CACGGAAATG CAGCGTCAGC CCCCGCGCTG
GACGCCGGGC GTCTTGAGTT CGAGCTCAAA ACAGACGCGG CGGTGAAGGT GAACTTCGCC
GATACGCTGT GGCTCAGCCT GGACGAAAAG GGGATTAGGC TTGAACGAAA AAGCCTGCGC
AACGATGAAA TATTGACGCG TTACTGGAAC GGAAAGGTCA CTTCTCTACG CGTTTTGTGC
GACAGATCCA GTGTCGAAAT TTTCATTAAT GAAGGCGAGG GCGTGATGAG CAGCCGCTAT
TTCCCGGGCC ATCCGGCGCA AATACGCTTC GAAGGTGCGT CCGTCATCAC ATTACGCTAC
TGGTTGCTTC GCGCTAGCAT GATAGAATGA
 
Protein sequence
MTIPSRWPAV LQAVMKGQPQ TLTDDHYPQW HLAPVIGLMN DPNGFIWFAG RYHLFYQWNP 
LGCDHRYKCW GHWSSADLVQ WQHEPMALMP DEEYDRNGCY SGSAVDNQGV LTLCYTGNVK
FDDGSRTAWQ CLAVQNEAGG FDKLGPVLPL PEGYTGHVRD PKVWQHDGQW YMVLGAQDLQ
KRGKVLLYQS ADLHSWQPLG EIAGHGVNDL ADAGYMWECP DLFELDGTHV LICCPQGLAR
EPHRYRNTYP STWMSGDFHY DDAKFEHGAL HELDAGFEFY APQTTLAADG RRILIGWMGV
PDGEEVLQPT CEHGWIHQMT CPRELHFRDG KLLQTPIREL QQLREEEQNW HGNAASAPAL
DAGRLEFELK TDAAVKVNFA DTLWLSLDEK GIRLERKSLR NDEILTRYWN GKVTSLRVLC
DRSSVEIFIN EGEGVMSSRY FPGHPAQIRF EGASVITLRY WLLRASMIE