Gene Dtpsy_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_1988 
Symbol 
ID7382107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp2124109 
End bp2125146 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content70% 
IMG OID643655306 
Productaminodeoxychorismate lyase 
Protein accessionYP_002553444 
Protein GI222111180 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCCG GCTTCTTTCA TACAGGCTGC AGGGTTGTGC GTCGTTTCCT CGCATTGGTG 
TTGCTCATCG TGATCGCCGT GGGCGCTGTG GCCGCCTGGT GGCTGCAGGC GCCGTTGCCG
GTGCGCGCGG ACGTGCCCGC CGGCCAGCCG CTGGAGCTGG AGATCGAGCC TGGCACCACG
CCACGCAGCG TGGCCCGCGC GGTGGTGCGG TCGGGCATGG CCACGGATGC AGACGTGCTG
TTCCTGTGGT TCCGGCTGTC GGGCAAGGAC CGCGAGATCA AGGCCGGCAA CTACGAGATT
CCCCAGGGCA CCAGCCCCTA CGCGCTGCTG CAGAAGCTGG TGCGCGGCGA GGAGGCATTG
CGCGCCGTCA CGCTGGTGGA AGGCTGGACC TTTCGCCAGG TGCGCCAGGC GCTGGCGCGG
GCCGAGCAGC TCAAGCCCGA CAGCCAGGGC CTGAGCGACG CGGACATCAT GGAGCGCCTG
GGCCGCGCGG GCGTGCCCGC GGAGGGGCGC TTCTTTCCCG ACACCTACAC CTATGCCAAG
GGCAGCAGCG ACATTGCCGT GCTGCGTCGC GCGCTGCACG CCATGGACCG GCGTCTGGAC
GCGGCCTGGG CGCAGCGCGC GCCGGACACG CCGCTCCAAT CCGCCGACCA GGCGCTGATC
CTGGCGAGCA TCGTCGAGAA GGAAACCGGC CGCGCCGAAG ACCGCGCGCA GATCGCCGGC
GTGTTCAGCA ACCGCCTGCG CGTGGGCATG CTGCTGCAGA CCGACCCCAC GGTGATCTAC
GGCCTGGGTG AGAAGTTCGA CGGCAACCTG CGCCGCCGTG ACCTGACCGC CGACACCCCC
TACAACACCT ACACACGTGT GGGCCTGCCG CCCACGCCGA TTGCCATGCC CGGCAAGGCG
GCGCTGTTGG CCGCGGTGCA GCCCGCGCCC ACCAAGGCGT TGTACTTCGT GGCGCGCGGC
GACGGTTCCA GCCACTTCAG CAGCACGCTC CAAGACCACA ACCGTGCGGT GAACCGCTAC
CAACGCGGCC AGAAATGA
 
Protein sequence
MMPGFFHTGC RVVRRFLALV LLIVIAVGAV AAWWLQAPLP VRADVPAGQP LELEIEPGTT 
PRSVARAVVR SGMATDADVL FLWFRLSGKD REIKAGNYEI PQGTSPYALL QKLVRGEEAL
RAVTLVEGWT FRQVRQALAR AEQLKPDSQG LSDADIMERL GRAGVPAEGR FFPDTYTYAK
GSSDIAVLRR ALHAMDRRLD AAWAQRAPDT PLQSADQALI LASIVEKETG RAEDRAQIAG
VFSNRLRVGM LLQTDPTVIY GLGEKFDGNL RRRDLTADTP YNTYTRVGLP PTPIAMPGKA
ALLAAVQPAP TKALYFVARG DGSSHFSSTL QDHNRAVNRY QRGQK