Gene SeD_A1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1039 
Symbol 
ID6871809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1038673 
End bp1040433 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content53% 
IMG OID642784224 
Producthypothetical protein 
Protein accessionYP_002214898 
Protein GI198245525 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.548133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.203105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA CATTTATTCC CGGCAAAGAC GCCGCGCTGG AAGACTCCAT CGCCCGCTTC 
CAGCAAAAGT TACTCGACCT CGGCTTTCAC ATCGAAGAGG CCTCCTGGCT GAACCCGGTG
CCAAACGTCT GGTCAGTGCA TATTCGCGAT AAAGAGTGCG CGTTATGCTT TACCAACGGA
AAAGGCGCGA CCAAAAAAGC GGCGCTGGCC TCGGCGCTGG GCGAATACTT CGAGCGTCTG
TCAACCAACT ACTTCTTTGC TGATTTCTGG CTTGGCGAAA CGGTCGCCAA TGGGCCATTC
GTGCATTACC CGAACGAAAA GTGGTTCCCG CTGACTGAAA ATGACGACGT ACCGGAAGGC
TTGCTTGATG CCCGTCTGCG CGCGTTTTAC GATCCGGAAA ATGAACTCAC CGGAAGCCAG
TTAATTGATC TTCAGTCCGG CAATGAAGCT CGCGGCGTCT GCGGCCTGCC ATTTACCCGT
CAGTCCGATA ACCAGACCGT GTATATTCCG ATGAATATCA TCGGCAACCT GTACGTCTCT
AACGGAATGT CCGCCGGCAA TACGCGTAAT GAAGCCCGCG TTCAGGGACT GTCGGAAGTC
TTCGAGCGTT ATGTGAAAAA TCGCATCATT GCGGAAAGTA TCAGTCTGCC GGAGATTCCC
GCAGAGGTGA TGGCGCGTTA TCCGGCGGTA ATGGAGTCAA TCGCCACGCT GGAAGCCGAG
GGTTTCCCGA TTTTCGCCTA TGACGGCTCG CTGGGCGGTA AGTATCCGGT TATCTGCGTC
GTGCTGTTCA ACCCGGCTAA CGGTACCTGC TTTGCTTCTT TTGGCGCCCA TCCTGACTTT
GGCGTTGCGC TGGAGCGTAC AGTGACCGAG CTACTCCAGG GACGCGGTCT GAAAGATCTT
GATGTCTTCA CGCCACCAAC GTTCGATGAT GAAGAAGTCG CGGAGCACAC TAATCTGGAG
ACCCACTTCA TCGACTCCAG CGGCCTGATT TCCTGGGATC TGTTCAAACA GGACGCCGAT
TATCCGTTCA CGGACTGGAG TTTTTCCGGC ACTACCGAAG AAGAATTCGC CACGCTGATG
GCCATCTTTG CTGCTGAAGA TAAAGAAGTT TACATTGCCG ATTACGAGCA TCTCGGCGTA
TACGCCTGTC GTATTATCGT ACCGGGAATG TCTGATATTT ATCCTGCCGA AGATCTGTGG
CTGGCCAACA ACAATATGGG TAGCCATCTT CGTGAGACTC TGCTTTCGCT GCCCGGTAGC
GCCTGGAATA AAGAAGATTA TCTCAATCTG ATTGAACAAT TGGATGAAGA AGGTTTTGAC
GATTTCACCC GCGTGCGTGA ACTGTTGGGT CTGGCGACCG GAGCGGACAA TGGTTGGTAT
ACACTGCGCG TCGGCGAATT AAAAGCAATG TTAGCGTTAG CGGGCGGCGA TTTGGAGCAG
GCGCTAATCT GGACAGAATG GACGATGGAG TTCAATTCGT CGGTCTTTAG TCCGACACGC
GCAAACTATT ACCGTTGCCT GCAAACTCTG CTGCTCCTGT CGCAAGAAGA TGCGCGTCAG
CCACTGCAAT ATCTCAATGC TTTTATAAAA ATGTATGGCG CAGAGGCTGT AGAGGCCGCC
AGCGCCGCGC TTAGCGGTGA AGCGGCTTTT TATGGACTAC CGGCTGTCGA CCACGATCTA
CAAGCGTTCC CGGCGCATCA GTCCTTGTTA AAAGCGTATG ATAAATTACA GCGCGCGAAA
GCGGCATACT GGTCAAAATA A
 
Protein sequence
MTQTFIPGKD AALEDSIARF QQKLLDLGFH IEEASWLNPV PNVWSVHIRD KECALCFTNG 
KGATKKAALA SALGEYFERL STNYFFADFW LGETVANGPF VHYPNEKWFP LTENDDVPEG
LLDARLRAFY DPENELTGSQ LIDLQSGNEA RGVCGLPFTR QSDNQTVYIP MNIIGNLYVS
NGMSAGNTRN EARVQGLSEV FERYVKNRII AESISLPEIP AEVMARYPAV MESIATLEAE
GFPIFAYDGS LGGKYPVICV VLFNPANGTC FASFGAHPDF GVALERTVTE LLQGRGLKDL
DVFTPPTFDD EEVAEHTNLE THFIDSSGLI SWDLFKQDAD YPFTDWSFSG TTEEEFATLM
AIFAAEDKEV YIADYEHLGV YACRIIVPGM SDIYPAEDLW LANNNMGSHL RETLLSLPGS
AWNKEDYLNL IEQLDEEGFD DFTRVRELLG LATGADNGWY TLRVGELKAM LALAGGDLEQ
ALIWTEWTME FNSSVFSPTR ANYYRCLQTL LLLSQEDARQ PLQYLNAFIK MYGAEAVEAA
SAALSGEAAF YGLPAVDHDL QAFPAHQSLL KAYDKLQRAK AAYWSK