Gene SeD_A1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1871 
Symbol 
ID6872649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1808603 
End bp1810303 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content57% 
IMG OID642784997 
Productputative amidohydrolase family protein 
Protein accessionYP_002215665 
Protein GI198242385 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0576549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGTC TCGTCCCGCC TCTGCTTTCC AGAACCGCGC TTCTCTTTCT GCTCACGGCC 
ACAGGCGCAG CCACCGCCGC CCGCCCGGCA GCGGATATTA TTTTGCATAA CGGTAATATC
ATCACTCTGA ATGATGCCCA GCCGCAGGCC AGCGCGCTGG TGATTTCCGG CTCGCGGATT
GTGGCGATTG GCGATGATAC GGCGACAGAT GAATGGCGCG GCGACCATAC CCGTACCATC
GATTTACAGG GTAAAACCGT GATACCCGGC CTGACCGATA CCCACATCCA CGCCATTCGC
GGCGGACAAA CCTGGACATT CGAAACCTAC TGGTACGACA GCCCTTCGCT TAAAGACGCG
CTGGATAAAT TACGCGCCGA CGCTAACCGT CGTCCCCACG ATCAATGGGT AGCCGTAGTG
GGATCGTGGA TACCGGCGCA ATTTGCAGAA AACCGGGCGC CGACGGTAGC CGAATTGAGC
CACGCCCTTC CCGATCATCC GGCTTATATT CAGTATCTTT ACGACTATGC TTTAGTGAAT
CAGCGCGGTA TAGACGTACT TGGCCTTAAC GACACCCCTC CTCCTGATTT AGCGGGAATC
CGCGTAGAGC GCGACGCAAA AGGTAGCGCC ACGGGGAAAT TATTTGGTGA CATCGCCGCG
TTTAACCAGC TTTTTGCCAG CATAAGTAGT AACGCCGATC GCGAGGGCGG TCTGCGACAA
TTTTTCGCTG ATATGAACGC TCGCGGCGTG ACCGGCATCA TTGACCCCTC TGCCGGGCCT
GCCGCCGCTT ATGAGCCTTT ATTTGCAATG CGAAACCAGG GGGATTTACC GCTGCGCGTG
GGGTATCGCA TTCCGGTACA GCCAGAAGCG AAAGGTCATG AAGCGCAGTG GTTCAGCAAC
CTGATGGCCT TTCGCCCGGC GCGTGCCGAT GACGGGCAAC TGGCTTTTCT TGGCCTGGGG
GAAAGCCTGG TGGCCGGAAT GAATGACGGC GTGCGGATGG CCCCAGGGTT TTCTTCCTCA
GAGCAGGACA AAACCGCGCT TCGCCAGGTC GCGACATTTG CGGCAAAACG GGGAATACCG
CTTGAGATCC ACGCCTATAC CGATGACAGC GCCGACGCGA TATTGACGAT TTTTGAGCAG
GTAGCGCAGC AGTACGATCT GCGCCCTCTC CGCTGGTCTA TTGCGCATCT GAATACCGGT
TCGCCACAGA CGCTTGAGCG AATGCGTAAG CTGGGTCTGG CATACACTGT GCAAATGGGG
CCTTACTTTG AGGGGCTTGC CATCCGTGAC GCCAATCCCC CCGGCGCGAC GGACAATTCG
CCGCCGGTTC GACTGGCGCT GGATAAAGGG CTTGTCGTAG CTGGCGGTAC CGATTCGACG
CGTATTGGCA TTGCCGGTGT CTGGCACGCT ATCGAATATC ATATCATCGG TATAGCGTCA
GGCGGTTCCG TGCGTAAACC CGCCAGCGAG CGGCTCACGC GTCTGGAAGC GCTAGCGTTA
TATACACGTC ATGCCGCCTG GCTCGCCTTT GCCGAACAAC ACCGGGGCCA GCTTCGCGTC
GGAAAACAGG CCGATCTGGC GGTACTCAAT CAGCCATTTA TGACGATGCC GGAAGACAGA
ATTGATACCA TTCGCGCTGT TTTGACGCTT GTCGATGGAC GCATTGTTCA CGAAAGTCCG
GACCTTAACG CCGGACAATG A
 
Protein sequence
MISLVPPLLS RTALLFLLTA TGAATAARPA ADIILHNGNI ITLNDAQPQA SALVISGSRI 
VAIGDDTATD EWRGDHTRTI DLQGKTVIPG LTDTHIHAIR GGQTWTFETY WYDSPSLKDA
LDKLRADANR RPHDQWVAVV GSWIPAQFAE NRAPTVAELS HALPDHPAYI QYLYDYALVN
QRGIDVLGLN DTPPPDLAGI RVERDAKGSA TGKLFGDIAA FNQLFASISS NADREGGLRQ
FFADMNARGV TGIIDPSAGP AAAYEPLFAM RNQGDLPLRV GYRIPVQPEA KGHEAQWFSN
LMAFRPARAD DGQLAFLGLG ESLVAGMNDG VRMAPGFSSS EQDKTALRQV ATFAAKRGIP
LEIHAYTDDS ADAILTIFEQ VAQQYDLRPL RWSIAHLNTG SPQTLERMRK LGLAYTVQMG
PYFEGLAIRD ANPPGATDNS PPVRLALDKG LVVAGGTDST RIGIAGVWHA IEYHIIGIAS
GGSVRKPASE RLTRLEALAL YTRHAAWLAF AEQHRGQLRV GKQADLAVLN QPFMTMPEDR
IDTIRAVLTL VDGRIVHESP DLNAGQ