Gene SeD_A2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2040 
SymbolastB 
ID6871676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1970367 
End bp1971710 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content56% 
IMG OID642785154 
Productsuccinylarginine dihydrolase 
Protein accessionYP_002215820 
Protein GI198242760 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3724] Succinylarginine dihydrolase 
TIGRFAM ID[TIGR03241] succinylarginine dihydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC ATGAAGTTAA TTTTGATGGG CTGGTGGGGC TTACGCACCA TTATGCCGGG 
CTATCCTTCG GCAATGAGGC ATCGACCCGC CACCGTTTTC AGATGTCGAA TCCTCGTCTG
GCGGTAAAGC AGGGGCTGCT AAAGATGAAG GCTTTGGCGG ATGCCGGCTT TCCCCAGGCG
GTGATCCCGC CGCATGAGCG GCCTTTTATT CCGGCGTTGC GTCAGCTCGG CTTCACGGGT
AGCGATGAGC AGATTCTGGA TAAGGTTGCG CGTCAGGCGC CACGCTGGCT TTCTAGCGTG
AGTTCCGCGT CGCCAATGTG GGTTGCGAAT GCGGCGACGG TTTGCCCATC GGCAGACGCG
CTGGACGGGA AAGTTCACCT GACGGTGGCG AATTTAAACA ATAAATTTCA TCGCGCTCTT
GAGGCGCCTG TTACCGAAGC GCTGCTACGC GCCATATTTC GCGATGAAAG TCAGTTTTCA
GTGCATAGCG CGTTACCGCA GGTCGCATTA TTGGGAGATG AAGGCGCGGC GAATCATAAC
CGTCTGGGCG GCGAGTATGG TTCGGCAGGC GTGCAGCTTT TTGTCTATGG GCGCGAAGAG
GAGAATGAAA TACGACCCGC TCGTTATCCG GCGCGCCAGA GCCGCGAAGC CAGCGAGGCC
GTGGCGCGTC TTAATCAGGT GAATCCGCAA CAGGTTATCT TCGCTCAGCA GAACCCGGAG
GTCATCGATC AAGGCGTATT CCATAATGAT GTCATCGCCG TTTCGAATCG ACAGGTATTG
TTTTGTCACG AAGCGGCGTT TGCCCGGCAG AAAGTGCTCA TTAATCAGTT GCGTACGCGC
GTTGACGGTT TTATGGCGAT AGAGGTGCCC GCCGGAGAGG TTTCTGTATC AGATGCTGTG
GCGACCTACC TGTTTAATAG TCAGTTGTTA AGCCGTGACG ACGGCTCAAT GCTGCTAGTG
TTGCCGCGGG AATGTCAGGA TCATGTCGGC GTCTGGCGCT ATCTGAATAA GCTGGTGGCG
GAGGATAACC CCATCAGCGC GATGCAGGTG TTTGATTTGC GAGAAAGTAT GGCTAACGGT
GGCGGGCCGG CCTGTCTGCG ATTACGCGTG GTGTTAACAG AAGAAGAACG ACGGGCGGTG
AATCCAGCGG TAATGATGAA TGACGCTCTG TTTACGGCCC TTAACGCGTG GGCGGATCGT
TATTATCGCG ATCGCCTGAC CGCTGCCGAT CTGGCCGATC CGTTATTATT GCGAGAAGGC
CGGGAGGCGC TGGATGTGTT AACGCGTCTG CTGGATTTGG GGTCGGTTTA TCCTTTCCAG
CAAACGGGGG CGGCTGATGG ATAA
 
Protein sequence
MTAHEVNFDG LVGLTHHYAG LSFGNEASTR HRFQMSNPRL AVKQGLLKMK ALADAGFPQA 
VIPPHERPFI PALRQLGFTG SDEQILDKVA RQAPRWLSSV SSASPMWVAN AATVCPSADA
LDGKVHLTVA NLNNKFHRAL EAPVTEALLR AIFRDESQFS VHSALPQVAL LGDEGAANHN
RLGGEYGSAG VQLFVYGREE ENEIRPARYP ARQSREASEA VARLNQVNPQ QVIFAQQNPE
VIDQGVFHND VIAVSNRQVL FCHEAAFARQ KVLINQLRTR VDGFMAIEVP AGEVSVSDAV
ATYLFNSQLL SRDDGSMLLV LPRECQDHVG VWRYLNKLVA EDNPISAMQV FDLRESMANG
GGPACLRLRV VLTEEERRAV NPAVMMNDAL FTALNAWADR YYRDRLTAAD LADPLLLREG
REALDVLTRL LDLGSVYPFQ QTGAADG