Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2442 |
Symbol | |
ID | 6874078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2315614 |
End bp | 2317146 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642785531 |
Product | colanic acid exporter |
Protein accession | YP_002216189 |
Protein GI | 198244397 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.000846764 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTCAATC TGGCTGCGCC AATGACGCCT TGTGAACAGG TTCTGGAGAA AAGAATGAGT TTACGACAAA AAACGATCAG CGGCGCTAAA TGGTCGGCTA TCGCCACGAT AGTGATTATC GGTCTGGGGT TAATTCAGAT GACGGTGCTG GCGCGGATCA TCGACAACCA CCAGTTTGGC CTGTTGACCG TCTCGCTGGT GATTATCGCG CTGGCCGACA CGATCTCGGA CTTTGGCATC GCGAACTCGA TTATCCAGCG TAAAACGATT GGGCATCTGG AGCTGACCAC GCTCTACTGG CTAAACGTTG GGCTTGGGAT TGTGGTATTT GCGGTGGTCT TTTGGCTGAG CGATGCGATT GCCCATGTTT TGCATAACCC GGATCTCGCG CCGTTAATCA AAACGTTGTC GCTGGCGTTC ATCGTGATCC CTCACGGGCA GCAGTTCCGC GCTCTGATGC AAAAAGAGCT GGAGTTCAAT AAGATCGGCA TGATCGAAAC GACCTCCGTG CTGGCGGGGT TCACCTTTAC GGTGATTAGC GCCCATTACT GGCCGCTGGC GTTAACCGCG ATTCTCGGCT ATCTGGTGAA CAGTGCGGTG CGAACGCTGC TGTTTGGTTA CTTTGGCCGC AAGATTTACC GTCCGGGGCT GCATTTTTCG CTGGCATCGG TGTCCACGAA CCTGCGTTTT GGCGCGTGGC TGACGGCGGA CAGTATCGTC AATTACATCA ACACGAATTT ATCGACGCTG GTACTGGCGA GAATTCTCGG CGCAAGCGTC GCCGGGGGAT ATAACCTCGC GTACAACGTG GCGGTTGTGC CGCCGGCGAA GCTTAACCCC ATCATTACTC GCGTGCTATT TCCGGCATTC GCCAAAATCC AGGACGATAC CGAGAAGCTG CGCGTCAACT TCTATAAGTT GCTTTCCGTG GTGGGAATTA TCAATTTTCC CGCGCTGTTG GGCCTGATGG TAGTGGCGAA CAATTTTGTG CCGTTAGTGT TTGGCGAGAA GTGGAACAGT ATTATCCCGA TCCTGCAATT GCTGTGCGTG GTGGGGCTGT TGCGCTCGGT CGGCAACCCG ATTGGTTCGC TGCTGATGGC GAAAGCGCGC GTGGATATCA GCTTTAAGTT CAACGTCTTT AAAACGTTTC TGTTTATCCC GGCAATTCTC ATTGGCGGCC ATCTGGCGGG GGCGATTGGC GTGACGCTGG GGTTCCTGGT GGTGCAAATC ATCAACACCA TTCTGAGCTA TTTCGTGATG ATTAAGCCGG TACTCGGCTC CAGCTATCGT CAGTATATTC TCAGCCTGTG GCTACCGTTT TATCTCTCAT TGCCAACATT TATTACCAGC TACGGCGCAG GAAAGCTGGC CGACGGCTAT TTACCGTTGT CAGGGGTATT CGCACTACAG GTTATGGTCG GTATTTTGAG TTTTATTTTG ATGATTATTT TTTCACGCAA TGCGCTGGTG ATGGAAATTA AAAATCAGCT TGTTGGCAGT GCAAAAATGA AAAAGTTATT ACGTGTCGGA TGA
|
Protein sequence | MLNLAAPMTP CEQVLEKRMS LRQKTISGAK WSAIATIVII GLGLIQMTVL ARIIDNHQFG LLTVSLVIIA LADTISDFGI ANSIIQRKTI GHLELTTLYW LNVGLGIVVF AVVFWLSDAI AHVLHNPDLA PLIKTLSLAF IVIPHGQQFR ALMQKELEFN KIGMIETTSV LAGFTFTVIS AHYWPLALTA ILGYLVNSAV RTLLFGYFGR KIYRPGLHFS LASVSTNLRF GAWLTADSIV NYINTNLSTL VLARILGASV AGGYNLAYNV AVVPPAKLNP IITRVLFPAF AKIQDDTEKL RVNFYKLLSV VGIINFPALL GLMVVANNFV PLVFGEKWNS IIPILQLLCV VGLLRSVGNP IGSLLMAKAR VDISFKFNVF KTFLFIPAIL IGGHLAGAIG VTLGFLVVQI INTILSYFVM IKPVLGSSYR QYILSLWLPF YLSLPTFITS YGAGKLADGY LPLSGVFALQ VMVGILSFIL MIIFSRNALV MEIKNQLVGS AKMKKLLRVG
|
| |