Gene Dtpsy_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_3049 
Symbol 
ID7383266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp3255164 
End bp3256675 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content69% 
IMG OID643656359 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002554482 
Protein GI222112218 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCGC AGCCTCCCGC TTCCCTCACC CGCCGTGCCT GCCTGGCCCA GGCCCTCGCG 
CTGTGCGCCG CGCCGGCCGC CGCACAGGTC TCACCGCGTG CCGCGGAACG CTCCGTGGTG
ATCAACCTCT CCCTGGAGCC CGACAGCCTG GACCCCACCA TGGCCGCTGC CGCCGCCGTG
GGCGAGGTGA CGCACTACAA CGTGCTGGAG GGGCTGACGC AGATCACGGA AAACGGCGCG
GTACAGCCAC TGCTGGCCGA GTCCTGGAAC ACCGACCCGA ACGGCCTCGC CTGCACCTTC
CGGCTGCGCC AGGGGGTGCG CTTTCACGAC GGCAGCGCGC TCAACGCGGC GGCCGTGCGT
TTCAGCTTCG AGCGTGCCGT CGCGCCCGGC TCCACCAACA AGTCCCGCAA GGCGCTCTTC
GACAACATCG CCACGGTGGT CACGCCCGAT GCGCACACCG TCACGCTCAC GCTGCACCAC
CCCGATCCGC ACCTGCTCTT CCGCCTGGGC GAGGGGCCGG CCGTGATCCT GCACCCGGAC
ACGGCCGCAC AGGCGGCCAG CGCCCCGGTG GGCACGGGCC CCTACCGCGT ACAGCGCTGG
GAGCGCGGCC AACGCATCAC GCTGGTCAAG GCCGAGTCCC ACCGCCATGC AGCCCAGGTA
CGCATGGAGC GTGCCGTGTA CCGCTTTCTG CACGACCTGG AGGCGCAGGA CAACGCCCTG
CGCGCGGGCG AGATCGACGT GCTGTTCAAC TTCGCCACGC AGACGGTGCG GCGCTTTCAG
GACAACAGCC ATTACCAGGT GTTGATCGGG GCCTCCGGCG GCAAGGGGAT GCTGGCGTTG
AACCACCGCA GCAAACCCCT GGACGACGTG CGCGTGCGCC GCGCCATCAC CCACGCCATC
GACCGCGAGG GCTTTATCCG CACCGCGCTG GATGGGCGCG GCGTGGTGAT CGGCAGCCAC
TTCAGTCCCA CGGACGCGGG TTACGTGCAC CTGGCCAGCC TACACCCCTA CGACCCCGCC
CGCGCACGCG CCCTGCTGAA ACAGGCCGGG GTGCAGACGC CGCTGCGGCT GGAACTGGCC
CTGCCGCCCG CACCGTACGC ACACGTGGGC GGGCCATTGA TCGCCCGCGA CCTGGCCCAG
GTGGGCATCG AGGCCAACAT CCAGCAGCTG AGCTGGCAGC AGTGGCTGCA GGGGCCGTTC
AAGGGCCAGT TCGACATGAC CCTCATCAAT CATGTGGAGC CGCTGGACTA CCGCATCTAC
ACCGACCCGG GCTACTACTT CGGCTACGAC AGCCCGGCCT TTCGCGCCCT GGCGGAGCGC
CACGCCAGCG CCACCAACGC GCGCGAGCGC CAGATGCTGT TCTCCCAGAT GCAGCGGCAC
TTGGCGCAGG ATGCCGTCAA CGCCTGGATC TTTGCACCGC AGATCGGCAC CGTGGTGCGC
AAGGGGCTGC GCGGCACGTG GATGAACTAC CCCATCTTTG CGCACAACAT CGCCGGCATG
TGGTGGGAAT GA
 
Protein sequence
MPAQPPASLT RRACLAQALA LCAAPAAAQV SPRAAERSVV INLSLEPDSL DPTMAAAAAV 
GEVTHYNVLE GLTQITENGA VQPLLAESWN TDPNGLACTF RLRQGVRFHD GSALNAAAVR
FSFERAVAPG STNKSRKALF DNIATVVTPD AHTVTLTLHH PDPHLLFRLG EGPAVILHPD
TAAQAASAPV GTGPYRVQRW ERGQRITLVK AESHRHAAQV RMERAVYRFL HDLEAQDNAL
RAGEIDVLFN FATQTVRRFQ DNSHYQVLIG ASGGKGMLAL NHRSKPLDDV RVRRAITHAI
DREGFIRTAL DGRGVVIGSH FSPTDAGYVH LASLHPYDPA RARALLKQAG VQTPLRLELA
LPPAPYAHVG GPLIARDLAQ VGIEANIQQL SWQQWLQGPF KGQFDMTLIN HVEPLDYRIY
TDPGYYFGYD SPAFRALAER HASATNARER QMLFSQMQRH LAQDAVNAWI FAPQIGTVVR
KGLRGTWMNY PIFAHNIAGM WWE