Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3996 |
Symbol | |
ID | 5110461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 4332644 |
End bp | 4333894 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640494214 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_001178702 |
Protein GI | 146313628 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0920288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.168346 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTGG CGAAAGCATC GGTGTGGACC GCCGCGTCCA CGCTCGTAAA GATTGGCACC GGGCTGTTAG TGGTTAAACT TCTGGCCGTC TCGTACGGCC CCTCAGGTGT TGGTCAGGCC GGCAATTTCC GCCAGCTTGT GACCGTGCTT GGGGTTCTCG CAGGTGCCGG TATTTTCAAC GGCGTGACCA AATACGTTGC ACAGCATCAT GACGATACCG CATCGCTTCG CAAGGTGATC GGTACCTCGT CCGCGATGGT ATTGGGTTTC TCGACGTTGC TGGCGGTTGT ATTTCTTCTT GCGGCGGCCC CAATCAGTCA GGGGCTTTTC GGCAACACCC ATTATCAGGG CCTGGTGCGC CTGGTTGCGC TGGTGCAGAT GGGCATTGCC TGGGCCAACC TGCTGTTAGC CTTAATGAAG GGTTTTCGGG ATGCGGCCGG GAACGCGCTG GCGCTGATTG CGGGCAGTTT TATTGGCGTC ATCGCCTATT ACTTTTGCTA TCGTCTGGGC GGCTACGAAG GCGCATTGCT TGGCCTGGCG CTGGTTCCCG CGTTGGTCGT GATCCCCGCT GCGTTGATGT TGATGCGCAG ACGCACGATT CCGCTAAGCT ATCTCAAACC GCAGTGGGAC AAAATTCTGG CGGGGCAATT GGGGAAATTT ACCCTGATGG CACTCATCAC ATCCGTCACG TTACCCGTGG CCTACGTGAT GATGCGAAAC CTGCTGGCGG CGCACTACAG CTGGGATGAA GTGGGGATCT GGCAAGGTGT GAGCAGTATT TCTGACGCCT ATCTCCAGTT TATCACTGCG TCTTTTAGCG TTTATTTGCT GCCAACCTTG TCGCGCCTGG TGTCAAAACA GGACATTACC CGCGAGATTG GCCGCTCTCT GCGTTTTGTT CTTCCTGCCG TGGCTGTCGC GAGTTTGACC GTCTGGTTGC TGCGAGATGT AGCCATCTGG CTGCTGTTCT CGGCAAAATT TACCGCGATG CGCGATCTGT TTGCCTGGCA ACTGGTGGGC GATGTACTGA AAGTGGGGGC TTACGTTTTT GGCTATCTGG TGATTGCTAA AGCGTCGCTG CGCTTGTACA TCCTGGCGGA AATCAGCCAG TTTTCGCTCT TAACCGCTTT CTCTCTTTGG CTGATCCCTG CGCACGGCGC GCTGGGGGCA TCACAGGCCT ATATGGCGAC TTACATCGTT TATTTCGCTG CCTGTTGCGG CGTATTTTTA CTTTGGCGTA AACGCGCATG A
|
Protein sequence | MSLAKASVWT AASTLVKIGT GLLVVKLLAV SYGPSGVGQA GNFRQLVTVL GVLAGAGIFN GVTKYVAQHH DDTASLRKVI GTSSAMVLGF STLLAVVFLL AAAPISQGLF GNTHYQGLVR LVALVQMGIA WANLLLALMK GFRDAAGNAL ALIAGSFIGV IAYYFCYRLG GYEGALLGLA LVPALVVIPA ALMLMRRRTI PLSYLKPQWD KILAGQLGKF TLMALITSVT LPVAYVMMRN LLAAHYSWDE VGIWQGVSSI SDAYLQFITA SFSVYLLPTL SRLVSKQDIT REIGRSLRFV LPAVAVASLT VWLLRDVAIW LLFSAKFTAM RDLFAWQLVG DVLKVGAYVF GYLVIAKASL RLYILAEISQ FSLLTAFSLW LIPAHGALGA SQAYMATYIV YFAACCGVFL LWRKRA
|
| |