Gene SeD_A1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1001 
SymbolltaE 
ID6872584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp990477 
End bp991478 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content60% 
IMG OID642784186 
ProductL-threonine aldolase 
Protein accessionYP_002214861 
Protein GI198243778 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATT TACGCAGTGA TACCGTTACC CGACCGGGTC GCGCCATGCT CGAGGCGATG 
ATGACCGCCC CGATCGGGGA CGACGTATAC GGCGATGACC CTACTGTTAA CGCCCTTCAG
CGCTACGCCG CCGACCTTTC CGGTAAAGAA GCGGCGCTTT TTTTACCCAC CGGCACCCAG
GCCAATCTGG TCGCGCTGCT TAGCCATTGT GAACGCGGCG AAGAGTATAT CGTCGGTCAG
GGCGCGCATA ATTATCTCTA TGAAGCTGGC GGCGCGGCGG TGCTCGGCAG CATTCAGCCG
CAGCCCATCG ACGCCGCCGC GGACGGTACG CTGCCGCTGG AGAACGTGGC GGCGAAGATT
AAAGCGGATG ACATCCACTT CGCGCGTACG CGCTTGCTCA GTCTGGAAAA TACGCATAAC
GGGAAAGTGC TGCCGCGCGC GTATCTGAAA GACGCCTGGA CGTTTACCCG CGAACGTGGG
CTGGCGCTGC ACGTTGACGG CGCCCGAATT TTTAACGCGG TGGTTGCCTA CGGCTGTGAG
TTAAAAGAGA TTACGCAGTA TTGCGACTCT TTTACCATCT GCCTGTCAAA AGGTCTCGGA
ACGCCGGTCG GTTCGCTGCT GGTCGGTAAC CGCGACTACA TTAAACGCGC GACACGCTGG
CGTAAAATGG TCGGCGGCGG AATGCGTCAG GCCGGGATTC TGGCAGCGGC CGGACTGTAT
GCGCTGAAGC ATAACGTGGC GCGTCTGCAA GAGGATCATG ATAACGCCGC CTGGCTGGCG
CAGCAGCTTC GCGAAGCGGG CGCGGAGGTC ATGCGCCACG AAACGAATAT GCTGTTTGTT
CGCGTTGGCG AAGCACAGGC CGCCGCGCTT GGCGACTATT TGCGGGAACG GAATATCCTG
ATTAACGCCG CGCCGATTGT GCGTCTGGTG ACGCATCTGG ATGTCTCTCG CGAACAGCTT
ACCGACGTCG TCGCCCACTG GCGCGCCTTT TTAGCCCGCT AA
 
Protein sequence
MIDLRSDTVT RPGRAMLEAM MTAPIGDDVY GDDPTVNALQ RYAADLSGKE AALFLPTGTQ 
ANLVALLSHC ERGEEYIVGQ GAHNYLYEAG GAAVLGSIQP QPIDAAADGT LPLENVAAKI
KADDIHFART RLLSLENTHN GKVLPRAYLK DAWTFTRERG LALHVDGARI FNAVVAYGCE
LKEITQYCDS FTICLSKGLG TPVGSLLVGN RDYIKRATRW RKMVGGGMRQ AGILAAAGLY
ALKHNVARLQ EDHDNAAWLA QQLREAGAEV MRHETNMLFV RVGEAQAAAL GDYLRERNIL
INAAPIVRLV THLDVSREQL TDVVAHWRAF LAR