Gene Ent638_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2355 
SymboltreA 
ID5111019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2543355 
End bp2545061 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content57% 
IMG OID640492538 
Producttrehalase 
Protein accessionYP_001177075 
Protein GI146312001 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.367107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATAA TTGATCTGCG CCGTCCTGCG GCTTTACCGC TGGCGCTGCG CGTCGCATTG 
GGAGGCGCAC TGCTGGGTGT GGCTGCGTTC AGCCATGCCG AAGAGGCACA ATCGACGCAA
AAGCAGTCGC CGGATATTCT GCTGGGTCCC CTGTTCGTCG ACGTGCAGAG CGCAAAATTA
TTCCCTGATC AAAAAACCTT TGCCGACGCG GTACCAAAAA GCGATCCCTT GATGATCCTG
GCAGATTACC GGATGCAGCA TAAACAGTCC GGCTTTGACC TGCGCCACTT TGTCGAGATG
AACTTCACCC TGCCGGGCGA AGGGGAAAAA TATGTTCCCC CCGCCGGACA GAACTTACGC
GAACATATTG ACGGGTTATG GCCGGTGCTG ACACGCACCA CCGACAAGGC CAGCAAATGG
GATTCACTGC TTCCGTTGCC AAAACCCTAT GTCGTGCCCG GCGGGCGCTT CCGTGAGGTC
TATTACTGGG ACAGCTACTT CACCATGCTG GGCCTGGCCG AGAGCGGCCA TTGGGACAAG
ATCAGCGATA TGGTGGCGAA CTTTGGCTAC GAACTTGACT CGTGGGGCCA CATTCCTAAT
GGGAACCGCA GCTATTATCT GAGCCGCTCT CAGCCGCCTT TCTTCTCGCT GATGGTTGAG
CTGCTGGCGA CGCATGATAA AGAGGCGCTG AAAACCTATC GCGCGCAGAT GGAAAAAGAG
TACGCCTACT GGATGGAGGG CGCGGAGACA TTGCAGCCGG GACAGGCGAA CAAACGGGTG
GTGAAACTGG ATGATGGCTC GATTCTGAAC CGCTACTGGG ATGACCGCGA TACGCCCCGC
CCTGAATCCT GGCTGGACGA CGTGACCACC GCTAAAAATA ATCCGAACCG TCCGGCGACG
GAGATCTATC GCGATCTGCG ATCTGCGGCG GCGTCTGGCT GGGACTTTAG CTCCCGCTGG
ATGGACGATC CGCAAAAGCT GGGGACGATC CGCACCACCA GCATCGTGCC TGTCGACCTG
AACGCCCTGA TGTTCAAGAT GGAAAAACTG CTGGCAAAAG CCAGCCAGGA ATCGGGTGAT
GCTGCTGCAA CCAGCAAGTA TGAAACCCTG GCGACGTCTC GTCAAAAAGC CATGGAAAGC
CATCTCTGGA ATGAAAAAGA GGGCTGGTAC GCCGATTACG ATCTGAAAAG CAAAAAGGTG
CGTAATCAGC TGACTGCCGC CGCGCTGTTC CCGCTGTATG TGAACGCCGC ATCAAACGAC
CGTGCTGCGA AAGTCGCCAG CGCGACGGCC TCGCGTTTGC TAAAACCGGG TGGGATCTCG
ACGACCACCA TCAATAGCGG CCAGCAGTGG GATGCGCCAA ACGGCTGGGC ACCTTTACAA
TGGGTTGCCA CTGAAGGATT GCAAAACTAC GGTCATGAAA AAGTGGCGAT GGATGTGACC
TGGCGCTTCC TCACCAACGT TCAGCACACC TACGATCGTG AGCAAAAACT GGTCGAGAAA
TATGACGTTT CTACCACCGG GACGGGCGGC GGCGGAGGCG AGTATCCGTT GCAGGATGGA
TTCGGCTGGA CCAACGGCGT GACCTTAAAA ATGCTGGATC TTGTGTGCCC GAAAGAGAAA
CCGTGCGACA GCGTACCGGC CTCGCAACCG GCGGCGAATG ATGATGTGGC TCCTCAATCG
TCAACGGAGA AGAGTGCGGC GCAGTAG
 
Protein sequence
MTIIDLRRPA ALPLALRVAL GGALLGVAAF SHAEEAQSTQ KQSPDILLGP LFVDVQSAKL 
FPDQKTFADA VPKSDPLMIL ADYRMQHKQS GFDLRHFVEM NFTLPGEGEK YVPPAGQNLR
EHIDGLWPVL TRTTDKASKW DSLLPLPKPY VVPGGRFREV YYWDSYFTML GLAESGHWDK
ISDMVANFGY ELDSWGHIPN GNRSYYLSRS QPPFFSLMVE LLATHDKEAL KTYRAQMEKE
YAYWMEGAET LQPGQANKRV VKLDDGSILN RYWDDRDTPR PESWLDDVTT AKNNPNRPAT
EIYRDLRSAA ASGWDFSSRW MDDPQKLGTI RTTSIVPVDL NALMFKMEKL LAKASQESGD
AAATSKYETL ATSRQKAMES HLWNEKEGWY ADYDLKSKKV RNQLTAAALF PLYVNAASND
RAAKVASATA SRLLKPGGIS TTTINSGQQW DAPNGWAPLQ WVATEGLQNY GHEKVAMDVT
WRFLTNVQHT YDREQKLVEK YDVSTTGTGG GGGEYPLQDG FGWTNGVTLK MLDLVCPKEK
PCDSVPASQP AANDDVAPQS STEKSAAQ