Gene Saro_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1100 
Symbol 
ID3916396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1141884 
End bp1143149 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content68% 
IMG OID640443835 
Productthreonine dehydratase 
Protein accessionYP_496379 
Protein GI87199122 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATG CAGCGCTCAC GCCCGTAGTG CAGGGGGCCG CCGACGACGC GGACGCCCTG 
CTGACTCTCG CCGACGTTCG TGCGGCGGCG GAGCGCATTG CCGGCCAGGT GGTGCGCACC
CCGACCCTGC ACAGCAAGAC GCTGAGCGCG ATTACCGGCG CGAACATCTG GATCAAGTTC
GAGAACCTGC AGTTCACCGC CGCCTACAAG GAGCGAGGTG CGCTCAATGC CCTGCTGCTC
CTTTCGCAGG AACAGCGCGC GCGGGGCGTG ATCGCCGCGT CGGCGGGAAA CCACGCGCAG
GGGCTTTCCT ACCACGGAAC CCGCCTGGGC GTTCCCGTGA CCATCGTCAT GCCGCGCACG
ACGCCGACGG TGAAGATCAT GCAGACCGAG GCCGTGGGCG GCAAGGTCGT GCTCGAGGGC
GAGACCTTCG ACGAGGCCTA TGCCCATGCG CGCAAGCTGG AGCGCGAACT GGGACTGACG
TTCGTCCACC CGTTCGACGA GCGCAATGTC GCGGCCGGAC AGGGTACGGT CGCGCTCGAG
ATGCTCGAGG ATGCGCCCGA GATCGACATG CTGGCCGTTC CCATCGGTGG CGGTGGCCTG
CTTTCGGGCA TGGGCACGGC GGCGCGCGGG ATCAAGCCGG AGATCGGCCT GATCGGCGTG
CAGGCACAGC TCTTCCCGTC GATGTTCGCG CGGCTCAAGC ACCTCGATTT GCCATGCGGC
GGCGATACGC TGGCCGAGGG CATCGCGGTG AAGGAGCCGG GGGCGTTCAC CTCCGCCGTG
CTGCGCGATC TGGTCGACGA TGTGGTGCTG GTGAACGAAG CCGCGCTGGA ATCGGCCGTG
GCGCTGCTGC TCCAGATCGA GAAGACCGTG GTCGAAGGCG CGGGCGCCGC GGGGCTGGCG
GCGGTGATGC AGAACCGGGA GCTGTTCGCG GGCCGCAACG TGGGCGTCGT GCTGACGGGC
GCGAACATCG ATACGCGCCT GCTGGCCAAC GTGCTGCTGC GCGATCTTGC ACGGTCGGGG
CGCCTCGGCC GCCTGCGCAT CACATTGCAG GACCGTCCTG GCGCGCTGTT CAAGGTGGTC
GAGGAGTTCA ACCGTCACCA GGTGAACATC CTTGAAGTTT GGCACCAGCG CATCTTCACT
TCGCTGCCGG CCAAGGGCCT GACCGCCGAG ATCGAGTGCG AGGCGCGCGA TCGCGAGCAG
ATCGACCGGC TCGTCGCCGG GCTGCGCGGC AAGGGCTACG ACGTCGAGCA GGTCGAACTG
GGGTAG
 
Protein sequence
MENAALTPVV QGAADDADAL LTLADVRAAA ERIAGQVVRT PTLHSKTLSA ITGANIWIKF 
ENLQFTAAYK ERGALNALLL LSQEQRARGV IAASAGNHAQ GLSYHGTRLG VPVTIVMPRT
TPTVKIMQTE AVGGKVVLEG ETFDEAYAHA RKLERELGLT FVHPFDERNV AAGQGTVALE
MLEDAPEIDM LAVPIGGGGL LSGMGTAARG IKPEIGLIGV QAQLFPSMFA RLKHLDLPCG
GDTLAEGIAV KEPGAFTSAV LRDLVDDVVL VNEAALESAV ALLLQIEKTV VEGAGAAGLA
AVMQNRELFA GRNVGVVLTG ANIDTRLLAN VLLRDLARSG RLGRLRITLQ DRPGALFKVV
EEFNRHQVNI LEVWHQRIFT SLPAKGLTAE IECEARDREQ IDRLVAGLRG KGYDVEQVEL
G