Gene Ent638_3965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3965 
Symbol 
ID5114685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4296573 
End bp4298036 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content53% 
IMG OID640494179 
Productglycoside hydrolase family protein 
Protein accessionYP_001178671 
Protein GI146313597 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.300617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAT CCCTCCCGTT CCCGCAAGGT TTTTTATGGG GCGGCGCAAT TGCCGCTAAT 
CAGGCCGAAG GGGCCTGGAA CGTTGATGGC AAAGGACCGT CGGTGGCGGA TGCCATCACC
TGGAAACCCA ATCTGTCGCT GAAAGATTAT GACGGCCACA TGGCGCTGAC GGATGAAAAT
ATTCAGGATG CGTTTGAAGG CAAAAACGAC ACACTTTACC CGAAACGTCG CGGCATCGAT
TTCTATCACC ACTATAAAGA CGATATCGCG CTGTTTGCCG AGATGGGCTT TAAAGTGCTG
CGCGTGTCCA TTGCCTGGTC ACGTATTTTC CCGGACGGCG AAGACGCGGC GCCGAATGAA
GCGGGCCTGC AATTTTACGA AGAGATGTTC CGTGAACTGC GTCGCCATCA CATCGAGCCG
CTGGTGACGC TTTCTCACTA CGAAATGCCG CTGGCGCTGA GTGAGCGATA TAACGGCTGG
GTGCACCGCA ACGTGGTGGA CGCGTTCGTG CGCTTCAGCA ATGTCTGCTT CGACCGCTAT
AAAGATCTGG TGCGCTACTG GCTCACGTTT AACGAAATCG ACAGCATCCA CCGCCACCCG
TTTACCACCG CCGGTATCCG CGAAGAGAAA AGCGCGCCGG GCAAAGCGAA ACAGGATATT
TATCAGGGGC TGCATCATCA GTTTGTCGCC TCGGCGCTGG TCACCCGTGA CTGCCACGCC
AAAATCCCTG GCAGCCAGGT CGGGTGTATG CTGACCAAAC TCACCACCTA TCCGCACAGC
TGCCGCCCGG AAGACGTTGA AGCGACGCTG AAAAAGAATC TCGAAAACTA TTTCTATGCG
GATGTGCAGG TCTTTGGGGA ATATCCGCCG CTGATCCTGC GCGATCTGGC GAGCCGCGAT
ATTCAGATTG AAATGCAAGC CGACGATCAG CGCATTTTAA AAGATCATAC CGTCGATTTC
GTCTCGTTCA GTTACTACAT GTCGCTGACC GAATCGACGC AGCCGGACGT GGAACGCATC
CCGGGTAACA CCATTCTTGG GGTGAAAAAC CCGTATCTGC CTGCGTCTGA ATGGGGCTGG
CAAATCGATC CGGTCGGGCT AAAAATTTCC CTGCTCGAAC TGTACGACCG TTACCAAAAG
CCGCTGTTTA TCGTTGAAAA CGGGCTGGGT GCGAAGGATA TCGTTGAAGA TGGCAAGATT
CACGACAGCT ACCGCATCGA CTATTTCCGC GCCCATTTCG AGCAAACTTT GGCGGCTATC
AATGAAGGGG TGGATGTGAT GGGATTCACC ACCTGGGGAT GCATCGACAT TATTAGCGCA
GGCACGTCCC AGATGTCCAA GCGCTATGGC TTTATCTATG TCGATCAGGA TGATGAAGGC
AACGGCACGT TAAAGCGCCT GAAAAAAGAT TCTTTTGGGT GGTATCAGAA AGTGATCGCC
AGCAATGGCG CTGACATGAG CTAA
 
Protein sequence
MDKSLPFPQG FLWGGAIAAN QAEGAWNVDG KGPSVADAIT WKPNLSLKDY DGHMALTDEN 
IQDAFEGKND TLYPKRRGID FYHHYKDDIA LFAEMGFKVL RVSIAWSRIF PDGEDAAPNE
AGLQFYEEMF RELRRHHIEP LVTLSHYEMP LALSERYNGW VHRNVVDAFV RFSNVCFDRY
KDLVRYWLTF NEIDSIHRHP FTTAGIREEK SAPGKAKQDI YQGLHHQFVA SALVTRDCHA
KIPGSQVGCM LTKLTTYPHS CRPEDVEATL KKNLENYFYA DVQVFGEYPP LILRDLASRD
IQIEMQADDQ RILKDHTVDF VSFSYYMSLT ESTQPDVERI PGNTILGVKN PYLPASEWGW
QIDPVGLKIS LLELYDRYQK PLFIVENGLG AKDIVEDGKI HDSYRIDYFR AHFEQTLAAI
NEGVDVMGFT TWGCIDIISA GTSQMSKRYG FIYVDQDDEG NGTLKRLKKD SFGWYQKVIA
SNGADMS