Gene Ent638_3016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3016 
Symbol 
ID5111725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3284919 
End bp3286286 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content55% 
IMG OID640493210 
Productglycoside hydrolase family protein 
Protein accessionYP_001177731 
Protein GI146312657 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.547717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAC CCCCTTTTAT TCTGTCCATC GCCGGTGGCG GCAGTACCTA TACGCCTGGC 
ATTGTGAAAA GCCTGATGGT GCAGTTACAG GACTTTCCGC TGGCAGAAAT TCGCCTCTAT
GATATCGATG CGGCGCGCCA GAACACCATT GCGCCAGTCG TTGAAAAAGT CATACGCGAT
CACAGCCAGA GCATTATTTT CACCGTCACC GACGATCCAG AAGTGGCCTT CAGCGGCGCG
CACTTTGTTT TTGCTCAGAT GCGCGTGGGT CAGTACAAAA TGCGCGAGCA GGATGAGAAG
ATCCCACTGC GTCACGGCGT AGTCGGCCAG GAAACCTGTG GGCCGGGTGG GCTTGCCTAC
GGACTGCGCA CAATCCTGCC GATGGTGGAA CTGATCGATC TTGTCGAGCG TTTCGCCCAT
GAGAAGGCCT GGATTGTGAA CTACTCCAAC CCGGCGGCGA TTGTGGCAGA AGGTGTGCGC
CGTCTGCGTC CGAACGCACG CGTGCTCAAC ATTTGCGATA TGCCGGTGGC GGCGATGCGC
AATATGGGGG CGATTTTGGG CGTCGATCGC CACAAACTGG AAGTCGATTA CTTTGGCCTG
AATCACTTCG GCTGGTTTAC GCGCGTGATG GTGGACGGCG TCGACAGACT GCCGGAGTTG
CGTAGCCATA TCGCCAAATT TGGGTTGCTG ACCGAAGACG CGGCCAAAAC CGATCCGCAG
CACTCCGATC CGTCATGGGT CAAAACCTGG CGCAACATTA AGCCGATCAT GGATAATTTC
CCGGACTATC TGCCGAATCC GTATCTGCAG TATTACCTGA TGCCTAACCA GATCGTTGAA
CATCAGAACC CGGATTACAC CCGCGCCAAC GAAGTGATGA ACGGGCGCGA GAAAAAGCTG
TTCGCGGCTG CTGAAGAGTA CAAGCGTACT GGCATTTTAT CCGATGCGTT CCACGTCGGC
GTTCACGGCG AGTTTATTGT GAATGTCGCT CGTTCGCTGG CGTTTAACCT GCGCCAGCGC
CATCTGGTGA TGGTCGAAAA CCGTGGTGCG ATCACCAATC TGCCTTACGA TGCGGTTGTT
GAAGTCCCGG CGTATATCAC ATCCGAAGGG CCAGAACCGA TTCGCGTCGG GCAGGTGCCG
CTGTTCCATC AGACTTTGCT GCAGCAGCAG CTTGCGTCTG AGCAACTGTT GGTCGAAGCC
ACTGTTGAAG GCAGCTACGA AAAAGCCCTG CAGGCCTTCA CCCTGAACCG CACGGTGCCA
ACAATGGAAC ACGCGAAAGC GATTCTGGAT GACATGATAG AAGCTAACCG GGACTACTGG
CCTGCGCTGC AAAAAGCCTG GCAGGACGGC GAAGCGGTGA AAAAATAA
 
Protein sequence
MFKPPFILSI AGGGSTYTPG IVKSLMVQLQ DFPLAEIRLY DIDAARQNTI APVVEKVIRD 
HSQSIIFTVT DDPEVAFSGA HFVFAQMRVG QYKMREQDEK IPLRHGVVGQ ETCGPGGLAY
GLRTILPMVE LIDLVERFAH EKAWIVNYSN PAAIVAEGVR RLRPNARVLN ICDMPVAAMR
NMGAILGVDR HKLEVDYFGL NHFGWFTRVM VDGVDRLPEL RSHIAKFGLL TEDAAKTDPQ
HSDPSWVKTW RNIKPIMDNF PDYLPNPYLQ YYLMPNQIVE HQNPDYTRAN EVMNGREKKL
FAAAEEYKRT GILSDAFHVG VHGEFIVNVA RSLAFNLRQR HLVMVENRGA ITNLPYDAVV
EVPAYITSEG PEPIRVGQVP LFHQTLLQQQ LASEQLLVEA TVEGSYEKAL QAFTLNRTVP
TMEHAKAILD DMIEANRDYW PALQKAWQDG EAVKK