Gene Ent638_3933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3933 
Symbol 
ID5111585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4251669 
End bp4253231 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content50% 
IMG OID640494142 
Productputative cytoplasmic protein 
Protein accessionYP_001178639 
Protein GI146313565 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.147057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.327931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCCCA TATTTTCAGT TGGTATCCAG TCATTGTGGG ATGAATTGCG CCACATGCCA 
GCCGGCGGAG TCTGGTGGAT TAGCACGGAT CGCAATGACG ATGCTATAAG TCTGGTGAAT
CAAACAATTG CAGCACAAGA TAAGGGCGCA AAAGTCGCCG TTGTCACTAT GGGTGAAGAC
CCTAAAAAAA TCATCAGACT CAATGAAACG CGCGGTCCCG ATAAAGTGCG TTTGTTTTCC
ATGCCCCATG AAGAAGATGG TCTATACTTT TTGCCCCGCG ATATTCAATG CAGTATTGAC
CCTGAACATT ATTTAGTGAT CCTCAAATGC ACAAATAATT TTTGGCAAAA TATATCTTCA
GAAAAATTGC GTCTGTGGCT GGAAAAGATC AATAAATGGG TGCGGGTTCA AAATTGTACG
CTGCTGGTAA TCAGCCCAGG CAGTAATAAT GATAAGCAGT TCTCATTTTT AATGAGTGAA
TATCGATCCC TTTTTGGTCT TGCCAGCCTC CGCCATCAGG CTGACAGCCA TCTTTACGAT
ATTGCTTTCT GGTGTAATGA AAAAGGCGTA AGCGCGCGGC AACAACTCAC CCTTATGCAT
AATAATGGCG AGTGGCATGT TGCGCGGCAA GAAGAAACGG TCGTTCAGCC GCGTAATGAT
GAAAAACGTA TTTTGAGTCA CATTGCGGTA TTAGAAGGTG CGCCTGCGCT TTCTGAATAT
TGGTCGCTGT TTGAAACCAA CGAAGGTTTG TTTAACGAGG CCCGCACGAC TCAGGCCGCC
ACGATTGTTT TTTCGCTCAC TCAGAATAAT CAAATTGAGG CGATTGCGCG GCAGATCCAT
ACCTTGCGCC GTCAGCGCGG AAGCGCGTTG AAAATCGTCG TGCGTGAGAA TACCACCAGC
CTGCGCGCCA CCGATGAGCG TCTGCTCCTG GGCTGTGGGG CAAACATGGT GATTCCATGG
AACGCGCCGC TTTCGCGCTG CTTAACGTTG ATCGAAAGCA TTCAGGGCCA GCAGTTTAAT
CGCTACGTCC CGGAGGATAT TTCGACGCTG CTTTCCATGA CCCAGCCGAT GAAACTGCGC
GGCTATCAGA AGTGGGACAC CTTCTGCGAA GCGGTCAGCA ATATGATGAG CAACACGCTG
TTGCCAGAAA ATGGTAAAGG CGTGATGGTC GCGCTGCGCC CGGTTCCGGG CATTCGTATT
GAACAGGCGC TGACGCTGTG CCGTCCAAAC CGTACCGGCG ATATCATGAC CATTGGCGAT
AATCGTCTGG TGCTGTTTTT ATCCTTCTGT CGGGTTAACG ACCTGGATAC CGCGCTGAAC
CACATATTCC CGCTGCCGAC CGGCGATATC TTCTCTAACC GCATGATTTG GTTTGAAGAC
AACTTGATCA GCGCCGAAAT CGTACAGATG CAAACGCTGG AACCTGAGCA GTGGGGCAAA
CCGCTGCTGA TGGCGAGCGA TGCGAAACCC GTTCTGAATG CTACGCATGA CGGGCACGCC
TGGCGCCGTA CCCCTGAGCC GCTTCGTTTA CTGAACGATG CGGAAGAGAG AGCTTCATCA
TGA
 
Protein sequence
MNPIFSVGIQ SLWDELRHMP AGGVWWISTD RNDDAISLVN QTIAAQDKGA KVAVVTMGED 
PKKIIRLNET RGPDKVRLFS MPHEEDGLYF LPRDIQCSID PEHYLVILKC TNNFWQNISS
EKLRLWLEKI NKWVRVQNCT LLVISPGSNN DKQFSFLMSE YRSLFGLASL RHQADSHLYD
IAFWCNEKGV SARQQLTLMH NNGEWHVARQ EETVVQPRND EKRILSHIAV LEGAPALSEY
WSLFETNEGL FNEARTTQAA TIVFSLTQNN QIEAIARQIH TLRRQRGSAL KIVVRENTTS
LRATDERLLL GCGANMVIPW NAPLSRCLTL IESIQGQQFN RYVPEDISTL LSMTQPMKLR
GYQKWDTFCE AVSNMMSNTL LPENGKGVMV ALRPVPGIRI EQALTLCRPN RTGDIMTIGD
NRLVLFLSFC RVNDLDTALN HIFPLPTGDI FSNRMIWFED NLISAEIVQM QTLEPEQWGK
PLLMASDAKP VLNATHDGHA WRRTPEPLRL LNDAEERASS