Gene SeD_A0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0821 
Symbol 
ID6874056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp817078 
End bp818010 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content59% 
IMG OID642784016 
Productallophanate hydrolase, subunit 2 
Protein accessionYP_002214695 
Protein GI198243417 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0263857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAATA TTATTCGCGC GGGGATTTAT ACCTCCGTGC AGGATAGCGG GCGTCACGGC 
TTTCGTCAGT CGGGGCTGAG TCACTGCGGC GCGCTGGATA AACCCGCCTT TCAGACCGCT
AATCTCCTGG TCGGGAACGA TGCGAATGCC CCGGCGCTGG AAATCACTCT CGGCCAACTG
GTTGTCGAAT TTGAAAATGA GACCTGGTTC GCTCTTACCG GCGCAGGCTG CGAGGCGCAG
TTGGATGATC AACCGGTCTG GACCGGCTGG CGATTGCCGG TAAAAGCGGG TCAGCGTCTT
ACGCTGCATC GACCGCTTCA CGGGATGCGT AGCTATCTGG CGGTGGCGGG CGGTATTGCT
GTGCCGGAGG TGATGGGATC GTGTAGTACC GATCTGAAGT CCGGCATCGG TGGGCTGGAA
GGACGGCTGC TAAAAGATGG CGATCGGCTG GCGACGGGTA AACCATCGCG ACAGTTTAGC
GGGCCGCAGG GCGTGAAGCA GTTACTGTGG GGGAATCGCA TCCGTGCGCT ACCGGGGCCG
GAATACCGTG AGTTCGATCG CGCCTCGCAA GAAGCGTTCT GGCGTTCGCC ATGGCAGCTC
AGCCCGCAAA GTAATCGCAT GGGCTATCGT TTGCAGGGAC AATCGTTAAC GCGGACAACG
GATCGCGAAC TGCTGTCGCA CGGTTTGCTG CCCGGTGTCG TGCAGGTGCC TTACAACGGC
CAACCTATTG TGCTAATGAA TGATGCCCAG ACAACTGGCG GCTATCCGCG CATTGCCTGC
ATCATCGAGG CAGATATGTA CCATCTGGCG CAGATCCCGC TAGGGCAACC GATCCATTTT
ATGCAATGTT CGCTGGAAGA GGCGCTCAAC GCGCGCCGCG AGCGTCAGCG CTATCTGGAA
CAGCTTACCT GGCGGCTTCA GCATGAACAT TGA
 
Protein sequence
MLNIIRAGIY TSVQDSGRHG FRQSGLSHCG ALDKPAFQTA NLLVGNDANA PALEITLGQL 
VVEFENETWF ALTGAGCEAQ LDDQPVWTGW RLPVKAGQRL TLHRPLHGMR SYLAVAGGIA
VPEVMGSCST DLKSGIGGLE GRLLKDGDRL ATGKPSRQFS GPQGVKQLLW GNRIRALPGP
EYREFDRASQ EAFWRSPWQL SPQSNRMGYR LQGQSLTRTT DRELLSHGLL PGVVQVPYNG
QPIVLMNDAQ TTGGYPRIAC IIEADMYHLA QIPLGQPIHF MQCSLEEALN ARRERQRYLE
QLTWRLQHEH