Gene SeD_A4145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4145 
Symbol 
ID6871897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3990607 
End bp3991644 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content51% 
IMG OID642787085 
Productvirulence protein 
Protein accessionYP_002217711 
Protein GI198243175 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.373189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA AATACTTAAC CCAATCCCCG GCAGGCGAAT TTGTTATGTT TGCCAGCGAT 
GACGGTGAAG TTCGTGTGGA GTGCCGCTTT GAGCAAGAGA CGCTATGGCT CCCTCAGGCA
ACCATCGCCA ACCTTTATCA GATCACTCCC CAGGCAGTTA CACAGCACAT TAAAGCGATC
TATGAAGAAG GCGAACTTGA GCAAAACGCA ACCTGTAAGT CTTACTTACA AGTTCAACAG
GAAGGTAGCC GTCAGGTAAG CCGCAACAGG CTTCACTACA GCCTGCCTGT CATCCTTGCT
GTCGGCTACC GCGTTCGTTC CCCGCGCGGC ACACAGTTCC GCCAGTGGGC AACCCAGACG
CTCCAGAAAT ACCTGATCAA AGGTTTTGTG ATGGACGATG AGCGCCTGAA AAATCCGCCC
GTGGGTTCAT CGGCTGTACC CGACTATTTT GATGAGATGC TGGAGCGTAT CCGCGATATT
CGCGCCAGCG AACGTCGGGT TTATTTGCGG GTACGAGAGA TCTTTGCGTT AGCCGCCGAC
TATCAACCAT CGCTCAAAGA AACCACGCAA TTTTTTCAAA CCATCCAGAA CAAGTTGCAT
TTTGCCTGTA CCGGACATAC CGCTGCTGAA CTCATTCATC AGCGTGCTGA CGCCAGCCAG
CCGCATATGG GGCTGACCAG CTATAAAGGT GAAGAGGTAC GTAAGGGTGA CGTGACGGTG
GCAAAAAATT ATCTCACTCA GGATGAAGTC AGCGAGCTTA ACCGCGTAGT TAACATGTGG
CTGGATTTTG CCGAGGATCA GGCCCGTCGT CGTCAGCAGA TCTTTTTACG CGACTGGCAG
GATAAGCTGG ATCAGTTCCT GCAATTTAAC GACCGTGAGG TTTTACAAGG CGCAGGTAAA
GTCACTAAGA AAATGGCCGA TGAAAAAGCG CAGGCGGAAT ATAGTCAGTT TGCTGAACAA
CAACGGCGCT TAAAAGAAGC CGAAGGTGAG AAGGATATCG CCGGTTTGCT ACAATGGAAA
ACAGAACCTA AAAAGTAG
 
Protein sequence
MADKYLTQSP AGEFVMFASD DGEVRVECRF EQETLWLPQA TIANLYQITP QAVTQHIKAI 
YEEGELEQNA TCKSYLQVQQ EGSRQVSRNR LHYSLPVILA VGYRVRSPRG TQFRQWATQT
LQKYLIKGFV MDDERLKNPP VGSSAVPDYF DEMLERIRDI RASERRVYLR VREIFALAAD
YQPSLKETTQ FFQTIQNKLH FACTGHTAAE LIHQRADASQ PHMGLTSYKG EEVRKGDVTV
AKNYLTQDEV SELNRVVNMW LDFAEDQARR RQQIFLRDWQ DKLDQFLQFN DREVLQGAGK
VTKKMADEKA QAEYSQFAEQ QRRLKEAEGE KDIAGLLQWK TEPKK