Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_0018 |
Symbol | |
ID | 5110506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 20038 |
End bp | 21360 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640490174 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001174759 |
Protein GI | 146309685 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT TCTCAGTTGT TATTGCAGGC GGCGGCAGCA CGTTTACACC TGGTATCGTC CTGATGCTGT TGGCAAACCG CGATCGCTTC CCGCTTCGCG CGCTGAAGTT CTATGACAAC GACGGTGCGC GTCAGGAGAT CATCGCCGAG GCGTGCAAAG TGATTCTTCA GGAACAAGCA CCAGAAGTTG ATTTTAGTTA CACCACCGAC CCAAAAGCGG CGTTTACCGA CGTTGATTTT GTGATGGCGC ATATCCGCGT CGGCAAATAT CCGATGCGTG AAAAAGATGA AAAAATCCCG CTGCGTCATG GTGTGCTAGG TCAGGAAACC TGCGGTCCGG GCGGGATCTC CTACGGTATG CGCTCTATTG GTGGCGTCCT TGAGCTGGTG GATTATATGG AACAATACTC GCCGAACGCG TGGATGCTGA ACTACTCCAA CCCAGCGGCG ATCGTGGCGG AAGCGACCCG TCGACTGCGC CCGAACGCCA AAATCCTCAA CATTTGTGAT ATGCCGATCG GCATTGAAGG GCGCATGGCG CAGATTGTCG GCCTGAAGGA TCGCAAAGCG ATGCGCGTGC GTTACTACGG GCTTAATCAC TTTGGCTGGT GGACATCGAT TGAAGATTTA GACGGTAACG ATCTGATGCC GAAACTGCGG GAATATGTCG CGAAAAATGG ATATTTACCG CCGTGTAACG ATGCGAATTC CGAAGCGAGC TGGAACGATA CCTTTGCCAA GGCGAAAGAC GTCCAGGCGT TGGACCCGGA CACGATGCCA AACACGTACC TGAAATATTA CCTTTTCCCG GACTACGTGG TGGCACACTC CAATCCAGAA CGCACCCGGG CAAATGAAGT CATGGATCAC CGCGAGAAGC ACGTGTTCAG CTCCTGCCGG GCGATTATCG AAGCCGGGAA ATCCTCCGCG GGTGAGTTGG AAATCGACGA ACATGCGTCT TACATCGTCG ATCTGGCGAC CGCTATCGCC TTCAACACGC AAGAACGCAT GCTGTTGATT GTGCCAAACA ATGGCGCTAT CCATAACTTT GATGCGGACG CGATGGTCGA AATTCCGTGT CTGGTGGGCA AAAATGGCCC AGAACCGTTA ACCGTGGGTG ATATTCCGCA CTTCCAGAAA GGGTTGATGG GCCAGCAGGT GGCCGTCGAA AAACTGGTGG TTGACGCCTG GGAACAGCGC TCTTACACCA AATTGTGGCA GGCGATTACG CTGTCGAAAA CCGTGCCGAG CGCCTCTGTG GCGAAAGCCA TTCTTGATGA CCTGATCGAC GCGAACAAAG CGTATTGGCC AGAGCTGCAT TAA
|
Protein sequence | MKKFSVVIAG GGSTFTPGIV LMLLANRDRF PLRALKFYDN DGARQEIIAE ACKVILQEQA PEVDFSYTTD PKAAFTDVDF VMAHIRVGKY PMREKDEKIP LRHGVLGQET CGPGGISYGM RSIGGVLELV DYMEQYSPNA WMLNYSNPAA IVAEATRRLR PNAKILNICD MPIGIEGRMA QIVGLKDRKA MRVRYYGLNH FGWWTSIEDL DGNDLMPKLR EYVAKNGYLP PCNDANSEAS WNDTFAKAKD VQALDPDTMP NTYLKYYLFP DYVVAHSNPE RTRANEVMDH REKHVFSSCR AIIEAGKSSA GELEIDEHAS YIVDLATAIA FNTQERMLLI VPNNGAIHNF DADAMVEIPC LVGKNGPEPL TVGDIPHFQK GLMGQQVAVE KLVVDAWEQR SYTKLWQAIT LSKTVPSASV AKAILDDLID ANKAYWPELH
|
| |