Gene Ent638_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3939 
Symbol 
ID5111591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4261018 
End bp4263300 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content60% 
IMG OID640494148 
Productcellulose synthase regulator protein 
Protein accessionYP_001178645 
Protein GI146313571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0298213 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCATC TGACCAAACT GACTTTGCTG GCGGCTTCCT TATTTGCGCT GCCGCTGCAT 
GGCGAAGAAA CCTCTGTTGA GGCGCTCTTG CCGGTAGAAA TGCCCGCGCC GACCGCGCCA
GTGCTGGCCA ATTTTGTGCC GTCCACGGTA AACAGTATCA GCGTGGCGCA GATGGGCCAG
CCGCAGGGCC TGGTGCTCAG CGGCGGCCAG TTACAGGGCG GGATGAACTT CACTCTGCCG
GTCGATCAGG TCATCACCAA CGCGCAGTTG TCGCTAAACC TGAAAGTCTC TCCGGCAATG
GCAACGCGCA ACGCCACCAT GCAGCTTATG CTCAACGGCC AGCCGCTCGG CACCGTGCCG
CTGGGTGCGG CGGACAGCGA CGTGTCTCGC TTCCAGCTGG ATATCCCCGC TGCGCTGTTG
GTATCCAGCA ACAGCCTGAG CTTTAAGATT AACGATGGCG ATGCGATGCA GTGCCAGCGC
GATCTGTCGG AAAAATACCG CGTCACCATC CTGCCGGATT CTCGCTTTGA TTTGGAAGGC
CAGCAGCTTG ATATCGGCGC GGATCTCAGC CATTTCCCAC GCCCGTTCTT TGATTCGATG
CAGATGACAC CCGCCACGAT TGCTTTCGCT TTTGGGCCAA CGCTCAGCAG CGATACGCTC
GGTGCAGCGG CATCGATCTC CTCCTGGCTC GGCATTCAGG CGGATTATCG CGGCGTCTCA
TTTGCCGCGC TGCACGACAC GCTACCCGAG AAAAATGGCA TTCTGATTGG CCATCCGGGC
GAGAAAATCG GCGGCCTGAC GCTGCCGCAG TCTTCTCAGC CGATGCTGCA GATTATCGAT
AACCCGGCGA ACCCGGTTTA TAAACTGCTG CTGATCGTCG GTAACGATGA AGCCGCGCTG
CGCACCGCCG CATGGCGACT GACGCGGGGC AATTTCACGC AGCAAACCGC CAGCATGGCG
ATCCCCACGC AGACGATCCC GGCCAGCAAA CCTTACGACG CGCCGCGCTG GATCCCAACC
GATCGTCCGG TGAAACTCTC AGAGCTGATC CGCAAGGATC AAAGCCTGAC CGTCACCGGG
ATCTGGCATG CGCCGCTGCG CGTGGCATTC CGCGCGGCCC CGGATCTGTT CCTGTGGGAT
GGCGAAACCA TTCCGCTGCA TATCGGCTAC CGTTTCCCGA CCGAAAGTTG GATCGACGAG
AACAAGTCCT GGCTCAGCAT GACCATGAAC GACACATTCC TGCACAACCT GCCGGTCAAC
AAACAGGGCG CGCTGGAAAC GCTGTGGCAT AAAGTGGGTG GCGACGCGCG GCAGGAGAAG
TTCGACATGC CGCTGGAGCC ATACATGATT TACGGTGACA ACCAGCTGTC GCTCTATTTC
AACATCACGG CGAAAGAGAA CGCGCCGTGT AGCGTGCTGA TGAACAACAA CATCAAGAGC
CGCATCGACG AAGATTCGTG GATCGATCTG AGCCACACGC GCCACTTCTC GCTCTTGCCG
AATCTTTCCT ATTTTGTCGG CGCATCCTTC CCGTTCACGC GTCTGGCCGA TTATTCGCAA
ACCGTTCTGC TGCTGCCTGA GCAGCCGACC GAAACGCAAA TCGCCACGCT GCTGGATATG
GCGGCGCGTT CCGGCAATGC CACCGGCACG GCGCTGTACA ACAACCGCGT AGTGCTCGGC
GTACCGACGG CGGGTGGTAA CCTCGAACTG CTGCGCACGC GGGATGTTCT GGCGGTCAGC
GGCATGGCGC AGCACGACTT CAACCAGGCA CTGTTGAGCG GTTCACCGTT TACTTCCCAT
GACAACACGC TGGGCGTCCG CACCCCGTCA ACGTGGCAGA AACTGCAACG CTGGCTGGCG
GGCGACTGGA CATCTGATGG TGTCGAAGCC GACCGCTATT TCTCCTCGAA CGAAGCGTGG
CGCGGGTTTG TCAGCTTCCG TTCCCCGTGG AGTAGCGATC GTCTGGTGGT GATGGCGCTC
GGCAGTAACG ACGAGCAGTT AGCCCGTCTG CATGACGATT TAACTTCTGC GCGCATCAAC
GCTGGGATTC GCGGTGACGC GGCCATTATC ACCAACGAAA ACGGCGTACG CAGCTTCCGC
GTGGGGTCGC AGTTCCCGAG CGGCGAAATG CCGACCCAGA TGATGATTGT CTGGTACGCC
AACCAGCACT CGGCGCTGCT GGCGATTCTG GGGCTGATCA TGAGTACGCT TTGCGGTCTG
GGGCTGTATG CCTGGCTGAA AAAGCGCGCG CGTAAGCGTT TGAATCCGGA GACAGGAAAG
TGA
 
Protein sequence
MKHLTKLTLL AASLFALPLH GEETSVEALL PVEMPAPTAP VLANFVPSTV NSISVAQMGQ 
PQGLVLSGGQ LQGGMNFTLP VDQVITNAQL SLNLKVSPAM ATRNATMQLM LNGQPLGTVP
LGAADSDVSR FQLDIPAALL VSSNSLSFKI NDGDAMQCQR DLSEKYRVTI LPDSRFDLEG
QQLDIGADLS HFPRPFFDSM QMTPATIAFA FGPTLSSDTL GAAASISSWL GIQADYRGVS
FAALHDTLPE KNGILIGHPG EKIGGLTLPQ SSQPMLQIID NPANPVYKLL LIVGNDEAAL
RTAAWRLTRG NFTQQTASMA IPTQTIPASK PYDAPRWIPT DRPVKLSELI RKDQSLTVTG
IWHAPLRVAF RAAPDLFLWD GETIPLHIGY RFPTESWIDE NKSWLSMTMN DTFLHNLPVN
KQGALETLWH KVGGDARQEK FDMPLEPYMI YGDNQLSLYF NITAKENAPC SVLMNNNIKS
RIDEDSWIDL SHTRHFSLLP NLSYFVGASF PFTRLADYSQ TVLLLPEQPT ETQIATLLDM
AARSGNATGT ALYNNRVVLG VPTAGGNLEL LRTRDVLAVS GMAQHDFNQA LLSGSPFTSH
DNTLGVRTPS TWQKLQRWLA GDWTSDGVEA DRYFSSNEAW RGFVSFRSPW SSDRLVVMAL
GSNDEQLARL HDDLTSARIN AGIRGDAAII TNENGVRSFR VGSQFPSGEM PTQMMIVWYA
NQHSALLAIL GLIMSTLCGL GLYAWLKKRA RKRLNPETGK