Gene SeD_A1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1100 
Symbol 
ID6873005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1096755 
End bp1098026 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content52% 
IMG OID642784285 
Producthypothetical protein 
Protein accessionYP_002214959 
Protein GI198242430 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.0366334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACTG AACACGTTAT TGAGTTCCTG CCGTTCCATG CAGGGCAGAA GAAAATTTAC 
CGTTCACCTG CAAAACGAAA AGTCATTCGC GCCGGGCGCC GCTTCGGTAA AACCACGATG
CTGGAGCAGG CTGGCGGAAA CTGGGCGGCT CGCCAGATGC GCGTAGGCTG GTTTGCTCCG
TCTTATAAAA TCCTGTTGCC GTCGTTTAAG ACCATCCGTG ACCTGTTAAA GCCGATCACG
ATTAGTTCCA GTAAGACCGA TTCGATTATT GAACTGATTG GCGGCGGTCT GGTTGAGTTC
TGGACGCTGG ATAATCCCGA TGCCGGGCGC TCCCGAAAAT ATCACAAAGT CATTATTGAT
GAGGGCAGTC TCGTCAAAAA GGGCATGAGG GATATCTGGG AACAGGCCAT TGAGCCGACG
CTGCTCGACT TTGACGGCGA TGCGGTGATG GCCGGTACGC CGAAAGGCGT TGATGACGAG
AATTTTTTCT ATCAGGCCTG TAATGATAAA TCGATGGGCT GGGAGGAACA TCATGCGCCG
ACTGCGGCTA ACCCGACAAT TAATCCGGCG GCGCTGGCCC GAATTATCGA CGGTCGCCCT
CCGCTGGTGG TTCAGCAGGA ATACAACGCT GAATTCGTGG ACTGGCGCGG GCAGAACTTT
TTCAAGCTCG ACTGGTTGCT GGAGAACGGC GCGCCTGTTG ATTATCCGTT TTCCTGCGAT
ACGGTTTATG GTGTCGTTGA CTGTGCGCAA AAGGGAAAAC TCCAGAACGA CGGATCCGCG
TGTATCTGGT TTGCGCTCGA TAACCTGCCG TCGCCACACC TTATCATTCT GGACTGGGAC
ATTATCCAGA TTGACGGGTA TTTCCTGAAA GACGTTGTGC CGCAGTGGGA AGGTAAAGCT
AAACACCTTA GCGAAATCTG CCGCGCCCGT ATGGGGACCA CAGGCCTGTT TATTGAGGAT
AAGGCAACCG GCATCACCCT GTTACAGCAG GACGCTAACG AGGGCTGGAA CGTCCACCCT
GTCGACAGTG AGTTAACGTC ACTTCCCAAA GAATCCCGCG CCATCAACAT TTCTGGTTAT
GTGGCGTCCG GGAAGGTACG CATTTCTAAA TACGCCTTTG ACAAAATCGT TGAGTACAAA
CAGTCGAAGA AAAACCATCT TCTGACGCAG GTACTCCAGT TCATCATTGG TGAAGAAAAC
CTGGACGACG ATCTGTTTGA CTGCTTTAAC TACGGCGTCG CGCTTGGTCT TGGTAACGGA
GAGGGGTTCT GA
 
Protein sequence
MATEHVIEFL PFHAGQKKIY RSPAKRKVIR AGRRFGKTTM LEQAGGNWAA RQMRVGWFAP 
SYKILLPSFK TIRDLLKPIT ISSSKTDSII ELIGGGLVEF WTLDNPDAGR SRKYHKVIID
EGSLVKKGMR DIWEQAIEPT LLDFDGDAVM AGTPKGVDDE NFFYQACNDK SMGWEEHHAP
TAANPTINPA ALARIIDGRP PLVVQQEYNA EFVDWRGQNF FKLDWLLENG APVDYPFSCD
TVYGVVDCAQ KGKLQNDGSA CIWFALDNLP SPHLIILDWD IIQIDGYFLK DVVPQWEGKA
KHLSEICRAR MGTTGLFIED KATGITLLQQ DANEGWNVHP VDSELTSLPK ESRAINISGY
VASGKVRISK YAFDKIVEYK QSKKNHLLTQ VLQFIIGEEN LDDDLFDCFN YGVALGLGNG
EGF