Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3939 |
Symbol | |
ID | 5111591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 4261018 |
End bp | 4263300 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640494148 |
Product | cellulose synthase regulator protein |
Protein accession | YP_001178645 |
Protein GI | 146313571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0298213 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCATC TGACCAAACT GACTTTGCTG GCGGCTTCCT TATTTGCGCT GCCGCTGCAT GGCGAAGAAA CCTCTGTTGA GGCGCTCTTG CCGGTAGAAA TGCCCGCGCC GACCGCGCCA GTGCTGGCCA ATTTTGTGCC GTCCACGGTA AACAGTATCA GCGTGGCGCA GATGGGCCAG CCGCAGGGCC TGGTGCTCAG CGGCGGCCAG TTACAGGGCG GGATGAACTT CACTCTGCCG GTCGATCAGG TCATCACCAA CGCGCAGTTG TCGCTAAACC TGAAAGTCTC TCCGGCAATG GCAACGCGCA ACGCCACCAT GCAGCTTATG CTCAACGGCC AGCCGCTCGG CACCGTGCCG CTGGGTGCGG CGGACAGCGA CGTGTCTCGC TTCCAGCTGG ATATCCCCGC TGCGCTGTTG GTATCCAGCA ACAGCCTGAG CTTTAAGATT AACGATGGCG ATGCGATGCA GTGCCAGCGC GATCTGTCGG AAAAATACCG CGTCACCATC CTGCCGGATT CTCGCTTTGA TTTGGAAGGC CAGCAGCTTG ATATCGGCGC GGATCTCAGC CATTTCCCAC GCCCGTTCTT TGATTCGATG CAGATGACAC CCGCCACGAT TGCTTTCGCT TTTGGGCCAA CGCTCAGCAG CGATACGCTC GGTGCAGCGG CATCGATCTC CTCCTGGCTC GGCATTCAGG CGGATTATCG CGGCGTCTCA TTTGCCGCGC TGCACGACAC GCTACCCGAG AAAAATGGCA TTCTGATTGG CCATCCGGGC GAGAAAATCG GCGGCCTGAC GCTGCCGCAG TCTTCTCAGC CGATGCTGCA GATTATCGAT AACCCGGCGA ACCCGGTTTA TAAACTGCTG CTGATCGTCG GTAACGATGA AGCCGCGCTG CGCACCGCCG CATGGCGACT GACGCGGGGC AATTTCACGC AGCAAACCGC CAGCATGGCG ATCCCCACGC AGACGATCCC GGCCAGCAAA CCTTACGACG CGCCGCGCTG GATCCCAACC GATCGTCCGG TGAAACTCTC AGAGCTGATC CGCAAGGATC AAAGCCTGAC CGTCACCGGG ATCTGGCATG CGCCGCTGCG CGTGGCATTC CGCGCGGCCC CGGATCTGTT CCTGTGGGAT GGCGAAACCA TTCCGCTGCA TATCGGCTAC CGTTTCCCGA CCGAAAGTTG GATCGACGAG AACAAGTCCT GGCTCAGCAT GACCATGAAC GACACATTCC TGCACAACCT GCCGGTCAAC AAACAGGGCG CGCTGGAAAC GCTGTGGCAT AAAGTGGGTG GCGACGCGCG GCAGGAGAAG TTCGACATGC CGCTGGAGCC ATACATGATT TACGGTGACA ACCAGCTGTC GCTCTATTTC AACATCACGG CGAAAGAGAA CGCGCCGTGT AGCGTGCTGA TGAACAACAA CATCAAGAGC CGCATCGACG AAGATTCGTG GATCGATCTG AGCCACACGC GCCACTTCTC GCTCTTGCCG AATCTTTCCT ATTTTGTCGG CGCATCCTTC CCGTTCACGC GTCTGGCCGA TTATTCGCAA ACCGTTCTGC TGCTGCCTGA GCAGCCGACC GAAACGCAAA TCGCCACGCT GCTGGATATG GCGGCGCGTT CCGGCAATGC CACCGGCACG GCGCTGTACA ACAACCGCGT AGTGCTCGGC GTACCGACGG CGGGTGGTAA CCTCGAACTG CTGCGCACGC GGGATGTTCT GGCGGTCAGC GGCATGGCGC AGCACGACTT CAACCAGGCA CTGTTGAGCG GTTCACCGTT TACTTCCCAT GACAACACGC TGGGCGTCCG CACCCCGTCA ACGTGGCAGA AACTGCAACG CTGGCTGGCG GGCGACTGGA CATCTGATGG TGTCGAAGCC GACCGCTATT TCTCCTCGAA CGAAGCGTGG CGCGGGTTTG TCAGCTTCCG TTCCCCGTGG AGTAGCGATC GTCTGGTGGT GATGGCGCTC GGCAGTAACG ACGAGCAGTT AGCCCGTCTG CATGACGATT TAACTTCTGC GCGCATCAAC GCTGGGATTC GCGGTGACGC GGCCATTATC ACCAACGAAA ACGGCGTACG CAGCTTCCGC GTGGGGTCGC AGTTCCCGAG CGGCGAAATG CCGACCCAGA TGATGATTGT CTGGTACGCC AACCAGCACT CGGCGCTGCT GGCGATTCTG GGGCTGATCA TGAGTACGCT TTGCGGTCTG GGGCTGTATG CCTGGCTGAA AAAGCGCGCG CGTAAGCGTT TGAATCCGGA GACAGGAAAG TGA
|
Protein sequence | MKHLTKLTLL AASLFALPLH GEETSVEALL PVEMPAPTAP VLANFVPSTV NSISVAQMGQ PQGLVLSGGQ LQGGMNFTLP VDQVITNAQL SLNLKVSPAM ATRNATMQLM LNGQPLGTVP LGAADSDVSR FQLDIPAALL VSSNSLSFKI NDGDAMQCQR DLSEKYRVTI LPDSRFDLEG QQLDIGADLS HFPRPFFDSM QMTPATIAFA FGPTLSSDTL GAAASISSWL GIQADYRGVS FAALHDTLPE KNGILIGHPG EKIGGLTLPQ SSQPMLQIID NPANPVYKLL LIVGNDEAAL RTAAWRLTRG NFTQQTASMA IPTQTIPASK PYDAPRWIPT DRPVKLSELI RKDQSLTVTG IWHAPLRVAF RAAPDLFLWD GETIPLHIGY RFPTESWIDE NKSWLSMTMN DTFLHNLPVN KQGALETLWH KVGGDARQEK FDMPLEPYMI YGDNQLSLYF NITAKENAPC SVLMNNNIKS RIDEDSWIDL SHTRHFSLLP NLSYFVGASF PFTRLADYSQ TVLLLPEQPT ETQIATLLDM AARSGNATGT ALYNNRVVLG VPTAGGNLEL LRTRDVLAVS GMAQHDFNQA LLSGSPFTSH DNTLGVRTPS TWQKLQRWLA GDWTSDGVEA DRYFSSNEAW RGFVSFRSPW SSDRLVVMAL GSNDEQLARL HDDLTSARIN AGIRGDAAII TNENGVRSFR VGSQFPSGEM PTQMMIVWYA NQHSALLAIL GLIMSTLCGL GLYAWLKKRA RKRLNPETGK
|
| |