Gene Ent638_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3940 
Symbol 
ID5111592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4263297 
End bp4265396 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content56% 
IMG OID640494149 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_001178646 
Protein GI146313572 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03030] cellulose synthase catalytic subunit (UDP-forming) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0284472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT TTCTATTTAT TTTGCTGGCC TTAGCAATGA TTCCTGTTGC CTTGCTCATC 
ATCATCACGC CGATGGACAG CCAGAAACAA TATATCTTTG GCTTTATTAG TATTGGCCTT
CTGTTTTTAA TGGGCTTTAG CAAAAGACGC AGTATTTCCG TGATTATGGT GGTGATGTCA
TTATTGATGT CGACCCGCTA TATGTATTTC CGCCTGACGC AAACCCTGCA TTTTAATTCT
GAGATTGAAA CAATACTCGG CATGGGGTTA TTTCTGGCGG AAGTCTACGT CTGGGTGCTT
TTGCTGCTCA ACTACCTGCA AACCGTCTGG CCGCTGAAGC GTGAAATTGT CCCGCTGCCC
GACGACATGT CGAAATGGCC GACGGTTGAC GTCTACATCC CAACCTATAA CGAATCGCTG
GACGTGGTGC GTGATACGGT GCTGGCCGCG CAGTGTATCG ACTACCCGCG CGACAAAATG
AAAATCTACC TGCTCGATGA CGGGAAGCGC CGTGAGTTTG CCGTCTTCGC GGCAGACGTT
GGTGTGGGTT ATATCACCCG TAACGACAAC TCCCACGCCA AAGCGGGTAA CCTGAACCAC
GCGATGAAGC TGACGCAGGG CGAGCTGATC ACGGTATTTG ACTGTGACCA CGTCGCCACG
CGTATTTTCC TGCAAGCCAC TGTGGGCGGG TTCCTGAAAG ATCCCAAGCT GGCGCTGGTG
CAAACGCCGC ACTATTTCTA TTCGCCAGAT CCGTTCGAAC GCAACCTCTC CGTCGGGCGA
AATATCCCGA ATGAAGGGAC GCTGTTCTAC GGCCCGATTC AGCAGGGCAA TGACAACTGG
AACGCCACGT TCTTCTGCGG CTCCTGTGCG GTTATCCGCC GTAGCGCCTT GGAAGAGATC
GGCGGCTTCG CCGTGGAAAC CGTCACCGAA GATGCGCACA CCGCGTTGAA AATGCAGCGC
CTCGGCTGGA AATCCGCCTT CCTCGATATT CCCCTGGCCG CCGGTCTTGC CACCGAACGC
CTGGTTCTGC ACGTCATCCA GCGTACCCGC TGGGCGCGCG GCATGACGCA AATTTTCCGT
CTGGATAACC CACTCTTAGG CCGTGGGCTG ACGATTCAGC AGCGCCTGTG CTACCTCAGC
GCCATGCTCT ATTATCAGTT CGCACTGCCG CGCATCGCGT TCTTAACCGC GCCGCTGGCC
TATCTGCTGT TCAACCTCAA CATTATTCAC TCGTCGGCGG GCCTGGTGTT TGCCTATGCG
CTGCCGCACC TGTTCCTGGC GATTTACCTC AACTCGCGAA TGAACGGCCG CTATCGCTAC
AGCTTCTGGG GTGAAATCTA CGATCTGGTG CTGGCATTCC ACCTGGTATT GCCAACCGCC
GTGACCATGA TTTTCCCGAA GCGCGGCAAG TTCAACGTGA CGGATAAAGG CGCGCTGCTT
AACGTCGGCT ACTTCGATTT CAGGGTCGTG CGCCCGCATC TGGTGATCGC CATTCTGCTG
GCCATCGGCG TTATTGCCGG TATTGTTCGC GCATGCGCCC ACGACTACTA CGGCGTCGAT
CCCAGCGTTA TCGCGCTCAA TGTCGGTTGG GGGCTTTACA GCCTGATCTT CCTGCTGGCG
GCGATTGCCG TCGCCCGTGA AACGCGCCAG ACGCGCAAAA CTATTCGTAT CGACGTCAAA
ATCCCGGTGC TGATTCACTA CGCCAGCGGG ATCTCTTCCC GCAGCCAGAC GGCGGATTTA
TCGATGGGCG GTTGCCGTAT CGAACTACCG GATGAACGCC ATCTGACGGA TGAAATCGAA
GAAGTGGAAC TCCTGCTGCA ATCGGGGGCG ATCAGCATCC CGGTGAAAGT GGTCGCCACA
GACGAAGAGT ATATCCGCCT GATGTTTGAA GATATTCCGC TGGCACGCCG CCGTGAACTG
GTGCGTGTGG TGCTGGCTCG CGCTGATGCG TGGATCCAGC CGCCAAAACC GCAGGATAAT
CCGTTCCGTT CGCTGCTGAC GATCGTGCGC TGCGTATTTG ATCTCTTCTG GCTGACGTGG
AAAACGCGCC GGGAGAACCG CCGCGCGCAA GCGCAGGTGC AGGAGGACGG CAACGCATGA
 
Protein sequence
MKKFLFILLA LAMIPVALLI IITPMDSQKQ YIFGFISIGL LFLMGFSKRR SISVIMVVMS 
LLMSTRYMYF RLTQTLHFNS EIETILGMGL FLAEVYVWVL LLLNYLQTVW PLKREIVPLP
DDMSKWPTVD VYIPTYNESL DVVRDTVLAA QCIDYPRDKM KIYLLDDGKR REFAVFAADV
GVGYITRNDN SHAKAGNLNH AMKLTQGELI TVFDCDHVAT RIFLQATVGG FLKDPKLALV
QTPHYFYSPD PFERNLSVGR NIPNEGTLFY GPIQQGNDNW NATFFCGSCA VIRRSALEEI
GGFAVETVTE DAHTALKMQR LGWKSAFLDI PLAAGLATER LVLHVIQRTR WARGMTQIFR
LDNPLLGRGL TIQQRLCYLS AMLYYQFALP RIAFLTAPLA YLLFNLNIIH SSAGLVFAYA
LPHLFLAIYL NSRMNGRYRY SFWGEIYDLV LAFHLVLPTA VTMIFPKRGK FNVTDKGALL
NVGYFDFRVV RPHLVIAILL AIGVIAGIVR ACAHDYYGVD PSVIALNVGW GLYSLIFLLA
AIAVARETRQ TRKTIRIDVK IPVLIHYASG ISSRSQTADL SMGGCRIELP DERHLTDEIE
EVELLLQSGA ISIPVKVVAT DEEYIRLMFE DIPLARRREL VRVVLARADA WIQPPKPQDN
PFRSLLTIVR CVFDLFWLTW KTRRENRRAQ AQVQEDGNA