Gene Ent638_3891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3891 
Symbol 
ID5110618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4198480 
End bp4200171 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content59% 
IMG OID640494100 
Productglycosyl transferase family protein 
Protein accessionYP_001178597 
Protein GI146313523 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG4261] Predicted acyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.845308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTAG CGTTTCGCCC CTGCGTGGTG ATCCCCTGCT ACAACCACGG GGCGATGATG 
GCAAACGTGC TGTCGCGTCT TGCCCCCTTC GGCCTGCCGT GCCTGATTGT GGATGACGGC
AGCGATGAAG GAACACGTCT GGAGTTGGAG CGTCTTGCCG CCGAACAACC GCAGGTCACG
CTTATCCGAC TGGCTGAAAA CGCCGGGAAA GGCACAGCAG TGATCCGTGG GCTGGAAGAG
TCCGCTCACG CGGGCTTTAC CCATGCGGTG CAGGTGGATG CCGACGGTCA ACACGCGATT
GAAGATATCC CGAAGTTGCT GGCGCTGGCA GAACGCCATC CGGAGGCACT CATTTCAGGC
CAGCCGATTT ATGACGATTC CATTCCACGT TCGCGGCTGT ATGGCCGCTG GGTCACCCAT
TTCTGGGTGT GGATCGAAAC GTTATCGTTC CAGTTAAAAG ACAGCATGTG CGGGTTCCGC
GTCTATCCCG TCGCCCCTAC GCTGCAACTG GCTGAGCGCG TGACGCTCGG CAAACGAATG
GATTTTGATA CAGAAGTGAT GGTGCGGCTC TACTGGCAAG GCAACACCAG CCATTTTCTG
CCGACGCGCG TGACGTATCC GCTCGACGGC CTGTCGCATT TCGACGCACT GAAAGATAAC
GTGCGCATCT CCCTGATGCA CACCCGCCTG TTCTTCGGCA TGTTGCCGCG CATTCCTGTT
CTGCTGATGC GTCGTCGCCA GCAGCACTGG GCGCAGCAGC AGGAAGTAAA AGGCCTGTGG
GGTATGCGGC TAATGCTGCG CGTCTGGCAA CTGCTGGGAC GCCGCGCGTT TGGTCTCCTG
CTATGGCCCG TTATCGGCGT CTACTGGCTG ACTGCACGTC CGGCACGCCA GGCGTCACAG
CAGTGGATTG AGCGTGTGAA GCGGGTGCTG GCACAGCGCC AATTGCCCAC ACCGCCACGC
CTTAACAGCT TTTTCCACTT TATGCGTTTT GGCTATTCGA TGCTGGATAA AGTCGCCAGC
TGGCGCGGTG AGCTGAAACT CAATCGGGAT GTGGTGTTCG CCCCCGGTGC GCGCGAGGCG
CTGGACGTTG ATGCGCCGCA AGGAAAATTA CTGCTGGCTT CGCATCTAGG CGATGTAGAA
GCTTGCCGCG CTATGGCGCA GTTGGATGGC AGCAAGGTCA TTAACGCGCT GGTATTTAGC
GAAAACGCCC AGCGTTTTAA GCAAATCATG GAAGAGATGG CGCCTGACGC AGGCCTTAAC
CTGATGCCCG TCACGGATAT CGGTCCGGAC ACGGCAATCG CCCTGAAAGA AAAACTCGAT
CGCGGCGAAT GGGTAGCGAT CGTCGGCGAT CGCATCGCGG TCAAACCGCA GCGCGGCGGC
GACTGGCGCG TCATCTGGAG CGAATTTATG GGCCAGCCCG CACCGTTCCC TCAGGGGCCG
TTTATTCTCG CGTCGATTCT GCGCTGCCCG GTGCTGCTGA TTTTCGCCCT GCGCCAGCAG
GACAAACTAC ACATTCACTG TGAGCCGTTT GCCGACCCGC TGATTTTACC GCGCGGTGAA
CGCCAGCAAG CGCTGCAGCG TACCGTCGAT CGCTATGCCG AACGGCTGGA GCATTACGCA
CTGATGTCGC CGCTCGACTG GTTTAATTTT TTCGATTTCT GGCATCTGCC GGATGCCAAA
GAGAAGGAGT AA
 
Protein sequence
MSLAFRPCVV IPCYNHGAMM ANVLSRLAPF GLPCLIVDDG SDEGTRLELE RLAAEQPQVT 
LIRLAENAGK GTAVIRGLEE SAHAGFTHAV QVDADGQHAI EDIPKLLALA ERHPEALISG
QPIYDDSIPR SRLYGRWVTH FWVWIETLSF QLKDSMCGFR VYPVAPTLQL AERVTLGKRM
DFDTEVMVRL YWQGNTSHFL PTRVTYPLDG LSHFDALKDN VRISLMHTRL FFGMLPRIPV
LLMRRRQQHW AQQQEVKGLW GMRLMLRVWQ LLGRRAFGLL LWPVIGVYWL TARPARQASQ
QWIERVKRVL AQRQLPTPPR LNSFFHFMRF GYSMLDKVAS WRGELKLNRD VVFAPGAREA
LDVDAPQGKL LLASHLGDVE ACRAMAQLDG SKVINALVFS ENAQRFKQIM EEMAPDAGLN
LMPVTDIGPD TAIALKEKLD RGEWVAIVGD RIAVKPQRGG DWRVIWSEFM GQPAPFPQGP
FILASILRCP VLLIFALRQQ DKLHIHCEPF ADPLILPRGE RQQALQRTVD RYAERLEHYA
LMSPLDWFNF FDFWHLPDAK EKE