Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3908 |
Symbol | |
ID | 6064385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4291202 |
End bp | 4292557 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641603322 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001726837 |
Protein GI | 170021883 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACTG CACCCAAAAT TACATTTATC GGCGCTGGTT CGACGATTTT CGTTAAAAAT ATTCTTGGTG ATGTGTTCCA TCGCGAGGCG CTGAAAACGG CGCATATTGC CCTGATGGAC ATTGATCCCA CCCGCCTGGA AGAGTCGCAT ATTGTGGTGC GTAAGCTGAT GGATTCAGCA GGGGCCAGCG GCAAAATCAC CTGCCACACC CAACAGAAAG AAGCCTTACA GGATGCCGAT TTTGTCGTGG TGGCATTTCA GATTGGCGGT TATGAACCTT GCACGGTGAC TGATTTCGAG GTCTGTAAGC GGCATGGTCT GGAACAAACC ATTGCCGATA CGTTGGGGCC GGGCGGTATT ATGCGCGCGC TACGTACCAT TCCGCATCTG TGGCAAATTT GCGAGGACAT GACGGAAGTC TGCCCCGATG CCACCATGCT CAACTATGTT AACCCCATGG CGATGAATAC CTGGGCGATG TATGCCCGTT ATCCGCATAT CAAACAGGTC GGGCTGTGCC ATTCGGTGCA GGGAACGGCG GAAGAGCTGG CGCGTGATCT CAATATCGAC CCAGCTACGC TGCGTTACCG CTGCGCAGGT ATCAACCATA TGGCGTTTTA CCTGGAGCTG GAGCGCAAAA CCGCCGACGG CAGTTACGTG AATCTCTACC CGGAACTGCT GGCGGCTTAT GAAGCAGGGC AGGCACCGAA GCCAAATATT CATGGTAATA CTCGCTGCCA GAATATTGTG CGCTACGAAA TGTTCAAAAA GCTGGGCTAC TTCGTCACGG AATCGTCAGA ACATTTTGCC GAATATACGC CGTGGTTTAT TAAGCCAGGT CGTGAGGATT TGATTGAGCG TTATAAAGTA CCGCTGGATG AGTACCCGAA ACGCTGCGTC GAGCAGCTGG CGAACTGGCA TAAAGAGCTG GAGGAGTATA AAAACGCCTC CCGGATTGAT ATTAAACCGT CACGGGAATA TGCCAGCACA ATCATGAACG CTATCTGGAC TGGCGAGCCG AGTGTGATTT ACGGCAACGT CCGTAACGAT GGTTTGATTG ATAACCTGCC ACAAGGATGT TGCGTGGAAG TAGCCTGTCT GGTTGATGCT AATGGCATTC AGCCAACCAA AGTCGGTACG CTACCTTCGC ATCTGGCCGC CCTGATGCAA ACCAACATCA ACGTACAGAC GCTGCTGACC GAAGCTATTC TTACGGAAAA TCGCGACCGT GTTTACCACG CCGCGATGAT GGACCCGCAT ACTGCCGCCG TGCTGGGCAT TGACGAAATA TATGCTCTTG TTGACGACCT GATTGCCGCC CACGGCGACT GGCTGCCAGG CTGGTTGCAC CGTTAA
|
Protein sequence | MMTAPKITFI GAGSTIFVKN ILGDVFHREA LKTAHIALMD IDPTRLEESH IVVRKLMDSA GASGKITCHT QQKEALQDAD FVVVAFQIGG YEPCTVTDFE VCKRHGLEQT IADTLGPGGI MRALRTIPHL WQICEDMTEV CPDATMLNYV NPMAMNTWAM YARYPHIKQV GLCHSVQGTA EELARDLNID PATLRYRCAG INHMAFYLEL ERKTADGSYV NLYPELLAAY EAGQAPKPNI HGNTRCQNIV RYEMFKKLGY FVTESSEHFA EYTPWFIKPG REDLIERYKV PLDEYPKRCV EQLANWHKEL EEYKNASRID IKPSREYAST IMNAIWTGEP SVIYGNVRND GLIDNLPQGC CVEVACLVDA NGIQPTKVGT LPSHLAALMQ TNINVQTLLT EAILTENRDR VYHAAMMDPH TAAVLGIDEI YALVDDLIAA HGDWLPGWLH R
|
| |