Gene Dtpsy_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_3541 
Symbol 
ID7384667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp3791916 
End bp3793502 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content65% 
IMG OID643656858 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002554964 
Protein GI222112700 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCAAC GTACCACCTT CAAACTGGGC TTGCTGGGGG CCGCCGTGCT CGTGGCCATG 
GGCACCGCCC ACGCCGCCAC CATGCGCTGG GCCGGCGCCA ACGACATCCT CACCGTGGAC
CCCCACGCGC AGAACCACCA GACCACGCAC GCCTTCCTGC AGCAGGTGTA CGAGAGCCTG
GTGCGCTATG ACGACAAGTA CCAGATCGAG CCCGCGCTGG CCACCAAGTG GACGCAGGTC
TCCCCCACAC AGGTGCGTTT TGAACTGCGC AAGGGCGTGA AATTCCATGA TGGTGCGCCA
TTCACGGCCG ATGACGTGGT GTTCTCGCTC ACGCGTGCCG GCACGGCGCC GTCCAACATG
ATGTCCGCGG TGCAGAGCGT CAAGGAAGTC AATAAGGTGG ACGAGCACAC GGTGGACGTG
TTCCTCAAGG GCCCCAATCC CATCCTGCTG CGCGAGCTGA CCGAGGCGCG CATCATGAAC
AAGGCCTGGG CCGAGAAGCA CAACTCTGTG AAGTCGCAAG ATTACGCGGG CAAGGAAGAG
AACTACGCCT CGCGCAACGC CAACGGCACG GGCCCCTTCG TCATGGTGGG CTGGCAGCCG
GACGTGAAGG TCACGCTCAA GAAGAACCCC AACTGGTGGG ACAAGCCCAA GGGCAACATC
GACGAGGTGG TGTTCACCCC CATCAAGTCG GCCGCCACGC GCTCGGCCGC GCTGATCTCG
GGGCAGGTGG ACTTCGTGGT CGATCCGCCG CCGCAGGACC TGGCGCGCAT GAAGGCCAAC
CCCGACATCA AGCTCATCGA AGGGGCCGAG AACCGCACCA TCTACCTGGG CCTGGACCAG
TTCCGCGACG AGCTGCCCGG CGCCGGCACG GCCGGCAAGA ACCCGCTCAA GGACAAGCGC
GTGCGCCAGG CGCTGTACCA GGCCATCGAC TCTGCGGGCC TGCACAGCCG CACCATGCGC
AACCTGTCGG TGCCTGCGGG CACCATGATC GCTCCCATGG TGCATGGCTG GACCAAGCAA
CTGAACGAGC GTGCCGCCAA GTACGACGTG GAGGCCGCCA AGAAGCTGCT GGCCGACGCC
GGCTACCCGA ACGGCTTCTC GCTCAAGCTC GACTGCCCGA ATGACCGGTA CGTGAATGAC
GAGGCCATCT GCCAGGCCGT CACCGCCATG TGGACACGCA TCGGCGTCAA GACCAACCTG
CAAACCGCGC CCATGGCGCA GTTCGTCACC CGCGTGATGA ACAACGACGT GAGCGCCTAC
CTGTTCGGCT GGGGCGTGGC CACGTTCGAC GCGCTGTATT CGCTGGACTC GCTGATGTCC
ACCAAGGACG GCAAGACCTC GGCTGGCGTC TACAACGGCG GGCGCTTTTC CGACGCCAAG
CTCGACGGCA TGATCCAGCA GATCAAGGTC GAGATGGATG CCCCCAAGCG CGACGCGCTG
ATCAACGACG CGCTCAAGCT GGTCAAGGAC GAGTACTACT ACCTGCCGCT GCACCACCAG
ATCCGCCCCT GGGCGATGCG CAAGAACGTG GACACGCCGC ATCGCGCGGA CGATCGCCCC
ATGCCCGCGT GGACGACCAT CAAGTGA
 
Protein sequence
MIQRTTFKLG LLGAAVLVAM GTAHAATMRW AGANDILTVD PHAQNHQTTH AFLQQVYESL 
VRYDDKYQIE PALATKWTQV SPTQVRFELR KGVKFHDGAP FTADDVVFSL TRAGTAPSNM
MSAVQSVKEV NKVDEHTVDV FLKGPNPILL RELTEARIMN KAWAEKHNSV KSQDYAGKEE
NYASRNANGT GPFVMVGWQP DVKVTLKKNP NWWDKPKGNI DEVVFTPIKS AATRSAALIS
GQVDFVVDPP PQDLARMKAN PDIKLIEGAE NRTIYLGLDQ FRDELPGAGT AGKNPLKDKR
VRQALYQAID SAGLHSRTMR NLSVPAGTMI APMVHGWTKQ LNERAAKYDV EAAKKLLADA
GYPNGFSLKL DCPNDRYVND EAICQAVTAM WTRIGVKTNL QTAPMAQFVT RVMNNDVSAY
LFGWGVATFD ALYSLDSLMS TKDGKTSAGV YNGGRFSDAK LDGMIQQIKV EMDAPKRDAL
INDALKLVKD EYYYLPLHHQ IRPWAMRKNV DTPHRADDRP MPAWTTIK