Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_2657 |
Symbol | wcaM |
ID | 5114624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 2856735 |
End bp | 2858126 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640492845 |
Product | putative colanic acid biosynthesis protein |
Protein accession | YP_001177374 |
Protein GI | 146312300 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.863181 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAA AACTCACCCG ACGCACTTTC GTTACGTCCT GCACACTGCT GGCCGCCACG CCGTTACTTA ACTCTCGCGT GGCGCGCGCT GCTGGCAGCG GGGCGGTCAA TATCAGCCAG TATAACCCCA AAGACTGGAT TGCCTCTTTT AAGCAGGCTT TCACGGAGAG CGATACCGTT ATCGTTCCTG CCGGACTGAC CTGCGAAAAT ATTAACACCG GCATTTTTAT TCCCGATGGT AAAACGCTGC TGATTCGCGG CGCGCTGACC GGCAATGGCC GCGGGCGTTT CGTCCTCCAG GAAGGCAGCA AAGTGGTCGG CGAAGGGAAA GGGCGTACCG AAAATATTAC CCTGGACGTG CGCGGATCTG ATTGCGTCAT TCAGGGGCTG GCGATGAGCG GCTTTGGGCC GGTGACGCAA ATCTACATTG GCGGTAAAAA GCCCAGAGTG ATGCGCAATC TGCTGATCGA CAATATTACC GTTACGCAGG CGAACTATGC GATTTTGCGA CAGGGTTTTC ACAATCAGGT GGACGGCGCG CGCATAACGA ACAGTCGCTT CAGCCATCTG CAGGGTGATG CCATTGAGTG GAACGTGGCC ATCAACGACC GCAACATTCT AATTTCTGAC CATGTTATCG ACAACATTAA CTGTACAAAT GGCAAGATTA ACTGGGGTAT CGGAATCGGG CTGGCGGGAA GCACCTACGA CAATGACTAC CCTGAGAAGC AAACCGTCAA AAACTTCGTC GTCGCGAATA TTACGGGAAG CAACTGTCGC CAACTCGTTC ACGTCGAAAA CGGCAAACAT TTTATTATCC GCAATGTAAA AGCGAAAAAT ATCACCCCTG ACTTCAGTAA AAAGGCGGGT ATCGATAACG CCACGGTGGC CATTTACGGG TGTGATAATT TCATTATTGA TGATATCGAT ATGGTTAATA GTGCAGGGAT GTTAATTGGT TACGGAGTGA TTAAAGGGGA CTATTTGTCG ATACCCCAGA ACTTTAAACT CAATGACATT CGGCTCGATA ACAGTCAGCT TGATTATAAA CTGCGCGGCA TTCAGATATC TTCCGGCAAT GCGACGTCGT TTGTTGCCAT CACCAACCTT GAAATGAAAC GCGCAACGCT TGAACTGCAT AACAAGCCCC AGCATCTCTT TTTGAGAAAT ATCAATGTGA TGCAAGAAGC CGCTATCGGC CCTGCGCTGA AGATGAACTT TGATCTGCGC AAAGATGTGC GTGGCAAATT TATGGCCAAG GACGAAACCC TGCTGTCGCT GGCAAACATC AAAGCCGTGA ATGAGAAGGG GCAGAGTTCA GTGGATATCG ATAGGGTGGA TCAGAAGGTG GTGAATGTGG AGCGTCTGAA TTTTAAGCTT CCGAGCAGAT AA
|
Protein sequence | MLKKLTRRTF VTSCTLLAAT PLLNSRVARA AGSGAVNISQ YNPKDWIASF KQAFTESDTV IVPAGLTCEN INTGIFIPDG KTLLIRGALT GNGRGRFVLQ EGSKVVGEGK GRTENITLDV RGSDCVIQGL AMSGFGPVTQ IYIGGKKPRV MRNLLIDNIT VTQANYAILR QGFHNQVDGA RITNSRFSHL QGDAIEWNVA INDRNILISD HVIDNINCTN GKINWGIGIG LAGSTYDNDY PEKQTVKNFV VANITGSNCR QLVHVENGKH FIIRNVKAKN ITPDFSKKAG IDNATVAIYG CDNFIIDDID MVNSAGMLIG YGVIKGDYLS IPQNFKLNDI RLDNSQLDYK LRGIQISSGN ATSFVAITNL EMKRATLELH NKPQHLFLRN INVMQEAAIG PALKMNFDLR KDVRGKFMAK DETLLSLANI KAVNEKGQSS VDIDRVDQKV VNVERLNFKL PSR
|
| |