Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4607 |
Symbol | |
ID | 6872655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4447472 |
End bp | 4448335 |
Gene Length | 864 bp |
Protein Length | 287 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642787510 |
Product | hypothetical protein |
Protein accession | YP_002218108 |
Protein GI | 198246134 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.356417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTTAA TCACCGATCT GGATGGAACA TTACTCACCT CGCAGAAAAC AATCAGCCCG CGCACGCGTC AGGCGCTGAT TGCGTTCCGC CAGGACGGTG GTCTACTGGC GGCATGTTCC GCCAGACCGG TCTCCTCAAT GGTACGCCTG CTACGCCAAC AACAGGTTGA TAGGTTGTTT AGCTGGTGCG CCGGTTTTAA CTGCGGACAC CTTCTGGAGA TGGCGGGACA GCGCATTATT CATGCTGCCC CTCTGACCGC CACAGACCTG TGGAATATTG ACCAGCATAT TTCTCTTTCC CGCTATCACC ACCATTTTTT TAGTGCCGAA GCAATTCACC ATCGTGACGA TAGACTGATT GCGCACTGGA CAACATATGA GGCTCGCTTA TTTGGATTAC CGCTTATAAC TGAAACTGCA GAAAATATCT TTAATCGTCG CAACATATAT AAAATTACAC TTGTTGCCGC ATCTCCGGAG ATAGATAATC TGTGTACAGA AGTGAATAAT CACCTGCCTT GTGGATATTA TGCGGTTGTC ACGGGAGAGA ATTATATTGA TATTCAAAGA TCCGATATAA ATAAAGGGTG CATAATAGAA CAATTAATTC ATTATTTAAA TATATCTTCT GACAAGGTGG TCGCGATTGG CGATCAGCAG AATGATGTCA GCATGTTTGC CGCCGCCGGA ATCAGCATCG CAATGGGCAA CGCGCCGGAT GCAGTAAAGC GGCAGGCCGG CTATGTGACT GCCACGAATG ATGAGGAGGG TATCGTCCAT GCGTTGGAGT GGTTGCGTTG CCTTACGCAT CCAGTTACCA TGCGCCAAAG GTTGACGGCG GCGAAAGATA ATGAATCCAA TTAA
|
Protein sequence | MLLITDLDGT LLTSQKTISP RTRQALIAFR QDGGLLAACS ARPVSSMVRL LRQQQVDRLF SWCAGFNCGH LLEMAGQRII HAAPLTATDL WNIDQHISLS RYHHHFFSAE AIHHRDDRLI AHWTTYEARL FGLPLITETA ENIFNRRNIY KITLVAASPE IDNLCTEVNN HLPCGYYAVV TGENYIDIQR SDINKGCIIE QLIHYLNISS DKVVAIGDQQ NDVSMFAAAG ISIAMGNAPD AVKRQAGYVT ATNDEEGIVH ALEWLRCLTH PVTMRQRLTA AKDNESN
|
| |