Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3935 |
Symbol | |
ID | 5111587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 4253415 |
End bp | 4255094 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640494144 |
Product | hypothetical protein |
Protein accession | YP_001178641 |
Protein GI | 146313567 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03368] cellulose synthase operon protein YhjU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0965348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAATT CTACTTATAC TGCTTCGGCA CCTTCGCCGC TTTGGCAATA CTGGCGCGGC CTTTCCGGCT GGAACTTCTA CTTTCTGGTG AAATTTGGTT TGCTGTGGGC GGGCTATCTG AATTTCCATC CGCTGTTAAA CCTGGTGTTT ATGGCATTTT TGCTGATGCC AATCCCCAAC CTGAAGCTAC ACCGCCTGCG TCACTGGATA GCCATCCCGA TTGGTTTTGT GCTGTTCTGG CACGACACTT GGCTGCCCGG CCCGGAAAGT ATTATGAGCC AGGGTTCGCA GGTCGCGGGC TTTAGCGCGG GCTATATTCA GGATCTGGTC GAGCGGTTTA TTAACTGGCA AATGATCGGC GCGATTTTCG TGCTGTTCAT TGCGTGGCTT TTCCTGTCGC AATGGCTACG CATCACGGTG TTCGTGGTGG CCATCCTGAT CTGGCTGAAC GTGTTAACCC TGGCCGGTCC GAGCTTCTCG TTGTGGCCTG CTGGTCAAGC GACGGACACG GTCACCACCA CGGGCGGTAC GGCTGCGCCT ACTGTCGCCA CTGCCGGAGC TACGCCGGTC ATTGGTGATA TCCCGTCACA AACCGCGCCG CCGACCAACA CCAATCTCAA CGCCTGGCTT TCCAGTTTTT ATAACGCAGA AGCCAAGCGC CAGACCAAAT TCCCGGATGC GTTACCCGCC GATGCACAGC CGTTTGAACT GCTGGTGATT AACATTTGCT CCCTTTCGTG GGCGGACGTG GAAGCTGCCG GTTTGATGTC GCATCCGCTG TGGTCGCATT TCGATATTCA GTTCAAAGAC TTCAACTCGG CCACTTCGTA CAGCGGTCCG GCGGCGATTC GCCTGCTGCG CGCAAGCTGC GGTCAGCCGT CGCATAAGAA TTTGTATCAG CCCGCAGGCA ACCAGTGTTA TCTGTTCGAT AACCTCGAAA AACTGGGCTT CTCTCAGCAC CTGATGCTGG ATCATAACGG TATTTTTGGC GGCTTCCTGA AAGAAGTGCG CGAAAACGGC GGCATGCATG CGCCGCTGAT GGATCAAACG GGTCTGCCGG TTCCGCTGCT GGGCTTTGAC GGTTCGCCAG TGTATGACGA CACCGCCGTG CTACAACGCT GGCTGCAAAC GGTCGGCAAG GACGATAACG CGCGTAGCGC CACGTTCTAC AACACGTTGC CGCTGCATGA CGGGAACCAC TATCCGGGCG TGAGCAAAAC GGCGGATTAC AAAGTGCGTG CGCAGAAATT CTTCGATGAG CTGGATGCGT TCTTTACTGA GCTGGAAAAA TCTGGCCGTA AGGTGATGGT TGTGGTCGTG CCAGAACATG GCGGCGCGCT GAAAGGCGAC AGAATGCAGG TGTCCGGGCT GCGCGATATT CCAAGCCCGT CGATCACCAA CGTGCCTGCC GGGATTAAAT TCTTTGGCAT GAAAACCCCG CATCAGGGTG CGCCAATTGA GATTACCCAG CCAAGCAGCT ATCTGGCGAT TTCAGAACTG GTGGCGCGTG CGGTGGACGG GAAATTGTTT GTGGAGGACA GCGTGAACTG GAATCAGCTC ACCAGCGGCC TGCCGCAGAC GGCGGAAGTC TCCGAGAACG CCAACGCGGT AGTGATTCAG TATCAGGATA AACCGTACGT GCGGTTGAAT GGCGGGGATT GGGTGCCGTA TCCGCAGTAA
|
Protein sequence | MTNSTYTASA PSPLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF MAFLLMPIPN LKLHRLRHWI AIPIGFVLFW HDTWLPGPES IMSQGSQVAG FSAGYIQDLV ERFINWQMIG AIFVLFIAWL FLSQWLRITV FVVAILIWLN VLTLAGPSFS LWPAGQATDT VTTTGGTAAP TVATAGATPV IGDIPSQTAP PTNTNLNAWL SSFYNAEAKR QTKFPDALPA DAQPFELLVI NICSLSWADV EAAGLMSHPL WSHFDIQFKD FNSATSYSGP AAIRLLRASC GQPSHKNLYQ PAGNQCYLFD NLEKLGFSQH LMLDHNGIFG GFLKEVRENG GMHAPLMDQT GLPVPLLGFD GSPVYDDTAV LQRWLQTVGK DDNARSATFY NTLPLHDGNH YPGVSKTADY KVRAQKFFDE LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQVSGLRDI PSPSITNVPA GIKFFGMKTP HQGAPIEITQ PSSYLAISEL VARAVDGKLF VEDSVNWNQL TSGLPQTAEV SENANAVVIQ YQDKPYVRLN GGDWVPYPQ
|
| |