Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2036 |
Symbol | |
ID | 6874937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1967629 |
End bp | 1968510 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642785150 |
Product | nucleotide excision repair endonuclease |
Protein accession | YP_002215816 |
Protein GI | 198245824 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.362279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTACGGC GTCAATCAGC CCCCCGCCTT GAGTTTGAAG CGGCGGCCAT TTACGAATAT CCAGAACATT TACGTCCTTT CCTCAGCGAA TTGCCAGCCT TACCGGGCGT CTATGTATTT CATAGCGAAA GCGATACGCT GCCGCTTTAT ATCGGTAAAA GCGTGAATAT TCGCAGCCGG GTGCTTTCAC ATCTGCGAAC GCCTGACGAG GCCACTATGC TGCGACAGGC GCGGCGGATA AGCTGGATCT GCACCGCTGG GGAAATGGGC GCGTTGCTGC TGGAGGCACG GCTGATCAAA GAACAACAGC CGCTGTTTAA CAAGCGGTTA CGTCGTAATC GCCAGCTTTG CTCGCTACAG TTGAGCGAAC AAAAGATTGA GGTCGTTTCC GCCCGCAGCG TCGATTTTTC CCATGAGCCA AACCTGTTTG GCCTCTTCGC CAACCGTCGG GCGGCGCTGC AAAGCTTACA GAACCTCGCC GATGAACAAA AATTGTGTTA TGGCCTGCTG GGGCTTGAAC CGGTAAGCCG TGGGCGCTCC TGTTTTCGTT TCGCTCTGAA GCGCTGCGCT GGCGCGTGCT GTGGGCAAGA AACCCCGCAG GCGCATTTTC TTCGTCTGCA GGCCTCGCTG GAACGGTTAC GTGTGGTTTG CTGGCCCTGG AAAGGCGCTA TCGCGTTAAA AGAAAGCCGC CCACAAATGA CCCAGTTCCA CATTATCAAC AACTGGTTAT GGCTGGGGGC GGTCCCCTCG CTGGATGAGG CCGCCACGCT GGTGCGCACC CCTGCGGGCT TTGATCAGGA TGGCTATAAA ATTCTCTGTA AGCCACTAAT GTCAGGTCAA TATGAGATCA TTGAACTGCA CACCGACTGT CGCCAGTCAT AA
|
Protein sequence | MVRRQSAPRL EFEAAAIYEY PEHLRPFLSE LPALPGVYVF HSESDTLPLY IGKSVNIRSR VLSHLRTPDE ATMLRQARRI SWICTAGEMG ALLLEARLIK EQQPLFNKRL RRNRQLCSLQ LSEQKIEVVS ARSVDFSHEP NLFGLFANRR AALQSLQNLA DEQKLCYGLL GLEPVSRGRS CFRFALKRCA GACCGQETPQ AHFLRLQASL ERLRVVCWPW KGAIALKESR PQMTQFHIIN NWLWLGAVPS LDEAATLVRT PAGFDQDGYK ILCKPLMSGQ YEIIELHTDC RQS
|
| |