Gene Ent638_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4001 
Symbol 
ID5110466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4338151 
End bp4339197 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content51% 
IMG OID640494219 
Productlipopolysaccharide biosynthesis protein WzzE 
Protein accessionYP_001178707 
Protein GI146313633 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3765] Chain length determinant protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.641375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0363623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAC CGTTGGCAGG ATCGAAATCA GTAGTAACTG AGAATGAACT GGATATTCGT 
GGTTTGTTTC GTGTTTTGTG GGCGGGCAAG CTTTGGATTG CAGGAATTGC GCTGGCGTTC
GCGCTGCTCG CGTTGGCGTA TACCTTTTTT GCAAAACAAG AATGGAGCGC GACAGCGATT
ACTGACCGTC CAACGGTGAA TATGTTGGGT GGTTTTTACT CCCAGCAGCA ATTTCTGCGC
AATCTCGACA TCAAGGCAAG TCTTGCGTCT ACCGATCAAC CTTCGGTGAT GGATGAGTCT
TACAAAGAGT TCATCATGCA GTTGGCTTCA TGGGATACCC GTCGTGATTT TTGGTCGCAG
ACGGATTATT ACAAGCAACG CATGGTCGGT AACAGCAAAG CGGATGCCGC GCTTCTTGAT
GATATGATTG ATAACATCCA GTTTATGCCT GGCGATGCTA TACGTAACAT CAATGATAAC
GTGAAGCTGA TCGCTGAAAC CGCGCCAGAC GCCAACAATC TGCTGCGTCA GTACGTCGCT
TTTGCCAGCC AGCGGGCGGC CAGTCATCTG AACGATGAGC TCAAAGGTGC CTGGGCTGCG
CGAACTATCC AGATGAAAGC GCAGGTCAAA CGCCAGGAAG AAGTGGCTAA AACGATTTTC
TCCCGCCGTG TACACAACAT CGAACAGGCT CTGAAAATTG CTGAACAACG CAATATTTCT
CGCAGTGAAA CCGATGTACC TGCCGACGAA TTACCAGATT CAGAGATGTT CCTGCTGGGA
CGCCCTATGC TTCAGGCTCG CCTGGAGAAT TTGCAGGCCG TTGGGCCCGA TTTTGACCTC
GATTACGATC AAAACCGCGC CATGCTGAAC ACGCTGAATG TAGGGCCTAC GCTCGACCCA
CGTTTTCAGA CCTATCGTTA TTTGCGAACG CCTGAAGAAC CTGTAAAACG CGATAGTCCG
CGTCGCGCAT TCCTGATGAT TATGTGGGGT ATTGTGGGCG CGCTGATTGG CGCAGGGGTG
GCACTCACGC GTCGTCGCAC AATTTAA
 
Protein sequence
MTQPLAGSKS VVTENELDIR GLFRVLWAGK LWIAGIALAF ALLALAYTFF AKQEWSATAI 
TDRPTVNMLG GFYSQQQFLR NLDIKASLAS TDQPSVMDES YKEFIMQLAS WDTRRDFWSQ
TDYYKQRMVG NSKADAALLD DMIDNIQFMP GDAIRNINDN VKLIAETAPD ANNLLRQYVA
FASQRAASHL NDELKGAWAA RTIQMKAQVK RQEEVAKTIF SRRVHNIEQA LKIAEQRNIS
RSETDVPADE LPDSEMFLLG RPMLQARLEN LQAVGPDFDL DYDQNRAMLN TLNVGPTLDP
RFQTYRYLRT PEEPVKRDSP RRAFLMIMWG IVGALIGAGV ALTRRRTI