Gene Ent638_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_0018 
Symbol 
ID5110506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp20038 
End bp21360 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content54% 
IMG OID640490174 
Productglycoside hydrolase family protein 
Protein accessionYP_001174759 
Protein GI146309685 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT TCTCAGTTGT TATTGCAGGC GGCGGCAGCA CGTTTACACC TGGTATCGTC 
CTGATGCTGT TGGCAAACCG CGATCGCTTC CCGCTTCGCG CGCTGAAGTT CTATGACAAC
GACGGTGCGC GTCAGGAGAT CATCGCCGAG GCGTGCAAAG TGATTCTTCA GGAACAAGCA
CCAGAAGTTG ATTTTAGTTA CACCACCGAC CCAAAAGCGG CGTTTACCGA CGTTGATTTT
GTGATGGCGC ATATCCGCGT CGGCAAATAT CCGATGCGTG AAAAAGATGA AAAAATCCCG
CTGCGTCATG GTGTGCTAGG TCAGGAAACC TGCGGTCCGG GCGGGATCTC CTACGGTATG
CGCTCTATTG GTGGCGTCCT TGAGCTGGTG GATTATATGG AACAATACTC GCCGAACGCG
TGGATGCTGA ACTACTCCAA CCCAGCGGCG ATCGTGGCGG AAGCGACCCG TCGACTGCGC
CCGAACGCCA AAATCCTCAA CATTTGTGAT ATGCCGATCG GCATTGAAGG GCGCATGGCG
CAGATTGTCG GCCTGAAGGA TCGCAAAGCG ATGCGCGTGC GTTACTACGG GCTTAATCAC
TTTGGCTGGT GGACATCGAT TGAAGATTTA GACGGTAACG ATCTGATGCC GAAACTGCGG
GAATATGTCG CGAAAAATGG ATATTTACCG CCGTGTAACG ATGCGAATTC CGAAGCGAGC
TGGAACGATA CCTTTGCCAA GGCGAAAGAC GTCCAGGCGT TGGACCCGGA CACGATGCCA
AACACGTACC TGAAATATTA CCTTTTCCCG GACTACGTGG TGGCACACTC CAATCCAGAA
CGCACCCGGG CAAATGAAGT CATGGATCAC CGCGAGAAGC ACGTGTTCAG CTCCTGCCGG
GCGATTATCG AAGCCGGGAA ATCCTCCGCG GGTGAGTTGG AAATCGACGA ACATGCGTCT
TACATCGTCG ATCTGGCGAC CGCTATCGCC TTCAACACGC AAGAACGCAT GCTGTTGATT
GTGCCAAACA ATGGCGCTAT CCATAACTTT GATGCGGACG CGATGGTCGA AATTCCGTGT
CTGGTGGGCA AAAATGGCCC AGAACCGTTA ACCGTGGGTG ATATTCCGCA CTTCCAGAAA
GGGTTGATGG GCCAGCAGGT GGCCGTCGAA AAACTGGTGG TTGACGCCTG GGAACAGCGC
TCTTACACCA AATTGTGGCA GGCGATTACG CTGTCGAAAA CCGTGCCGAG CGCCTCTGTG
GCGAAAGCCA TTCTTGATGA CCTGATCGAC GCGAACAAAG CGTATTGGCC AGAGCTGCAT
TAA
 
Protein sequence
MKKFSVVIAG GGSTFTPGIV LMLLANRDRF PLRALKFYDN DGARQEIIAE ACKVILQEQA 
PEVDFSYTTD PKAAFTDVDF VMAHIRVGKY PMREKDEKIP LRHGVLGQET CGPGGISYGM
RSIGGVLELV DYMEQYSPNA WMLNYSNPAA IVAEATRRLR PNAKILNICD MPIGIEGRMA
QIVGLKDRKA MRVRYYGLNH FGWWTSIEDL DGNDLMPKLR EYVAKNGYLP PCNDANSEAS
WNDTFAKAKD VQALDPDTMP NTYLKYYLFP DYVVAHSNPE RTRANEVMDH REKHVFSSCR
AIIEAGKSSA GELEIDEHAS YIVDLATAIA FNTQERMLLI VPNNGAIHNF DADAMVEIPC
LVGKNGPEPL TVGDIPHFQK GLMGQQVAVE KLVVDAWEQR SYTKLWQAIT LSKTVPSASV
AKAILDDLID ANKAYWPELH