Gene Ent638_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4068 
Symbol 
ID5110822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4422984 
End bp4424243 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content59% 
IMG OID640494293 
ProductL-rhamnose isomerase 
Protein accessionYP_001178774 
Protein GI146313700 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4806] L-rhamnose isomerase 
TIGRFAM ID[TIGR01748] L-rhamnose isomerase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC AACTTGAACA AGCCTGGGAA CTGGCTAAAC AGCGTTTCGC CGCCGTCGGC 
GTAGATGTCG AAGAGGCGCT GCGTCAGCTC GATCGTCTGC CGGTTTCCAT GCACTGCTGG
CAAGGCGATG ACGTCGCAGG CTTTGAAAAT ACCGGCGCCG CGCTGACGGG GGGTATTCAG
GCCACCGGTA ACTATCCAGG TAAAGCACGT AACGCGACCG AGCTGCGCGC CGACCTGGAG
CTGGCGCTGA GCTTAATCCC CGGCCCAAAA CGCCTGAATC TGCATGCGAT TTACCACGAA
GCCCCGGAGC CGGTCGGGCG TAACGAAATC AAACCGGAAC ACTTTAAGAA CTGGGTTGAG
TGGGCAAAAG CCAACAAACT CGGGCTGGAT TTCAATCCGT CCTGCTTCTC GCATCCGCTG
AGCGCCGACG GTTTTACTCT GTCTCACGCA GATGATGAAA TTCGCCAGTT CTGGATCGAC
CACGTCAAGG CCAGCCGCCG CGTGTCCGCT TATTTTGGCG AACAGCTGGG CACGCCGTCG
GTGATGAACA TCTGGATCCC GGACGGCATG AAAGACATCA CCGTCGACCG TCTTGCCCCA
CGTCAGCGCC TGCTGGCCGC GCTCGATGAG ATCATCAGCG AAAAGATAAA TCCGGCGCAT
CACATTGATG CCGTGGAAAG CAAACTGTTC GGTATCGGCG CGGAGAGCTA CACCGTTGGC
TCAAATGAGT TCTACATGGG TTACGCCACC AGCCGCCAGA CCGCGCTGTG CCTGGATGCC
GGCCACTTCC ATCCAACGGA AGTGATCTCC GACAAAATCT CTGCCGCCAT GCTCTACGTG
CCGCGCCTGC TGCTGCACGT CAGCCGTCCG GTGCGTTGGG ACAGTGACCA CGTGGTGCTG
CTGGATGACG AAACCCAGGC CATCGCCAGC GAAATCATCC GTCACGATCT GTTCGACCGC
GTGCACATTG GGCTCGACTT CTTTGACGCC TCGATCAACC GCATCGCCGC GTGGGTGATT
GGTACGCGCA ACATGAAAAA AGCCCTGCTG CGCGCGCTGC TTGAGCCTAC GGCTGCACTG
AAGCAGCTCG AAGAAAACGG CGATTACACC GCACGTCTAG CACTGCTGGA AGAGCAAAAA
TCCCTGCCGT GGCAGGCCAT CTGGGAAATG TACTGCCAGC GCAACGACGC GCCAGCCGGT
AGCCAGTGGC TGGACAACGT GCGGGCATAT GAGAAAGAGG TGTTGAGCCA GCGCGGGTAA
 
Protein sequence
MTTQLEQAWE LAKQRFAAVG VDVEEALRQL DRLPVSMHCW QGDDVAGFEN TGAALTGGIQ 
ATGNYPGKAR NATELRADLE LALSLIPGPK RLNLHAIYHE APEPVGRNEI KPEHFKNWVE
WAKANKLGLD FNPSCFSHPL SADGFTLSHA DDEIRQFWID HVKASRRVSA YFGEQLGTPS
VMNIWIPDGM KDITVDRLAP RQRLLAALDE IISEKINPAH HIDAVESKLF GIGAESYTVG
SNEFYMGYAT SRQTALCLDA GHFHPTEVIS DKISAAMLYV PRLLLHVSRP VRWDSDHVVL
LDDETQAIAS EIIRHDLFDR VHIGLDFFDA SINRIAAWVI GTRNMKKALL RALLEPTAAL
KQLEENGDYT ARLALLEEQK SLPWQAIWEM YCQRNDAPAG SQWLDNVRAY EKEVLSQRG