Gene Dtpsy_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_2107 
Symbol 
ID7383042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp2262557 
End bp2263600 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content62% 
IMG OID643655426 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002553562 
Protein GI222111298 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATGA AATCCTTGGT TGTTGCATGT GTTCTGGCCG CTGCAGGCGC TGCAAGTGCG 
CAGGACCAGG TGGTCAACCT GTATTCCGCC CGCCACTACG CCACCGATGA AGCGCTGTAC
AGCGGCTTTA CCAAGGCCAC GGGCATCAAG ATCAATCGCG TGGATTCTGA TGACGCCGGG
ATCATGGCCC GTCTCAAGGC AGAAGGAGCC GCATCCCCTG CCGACGTGAT CTTGCTGGTG
GATGCCGCCC GCCTCTACCG TGGCGAGGCC GACGGCCTGT TCCTGCCCGT GCGCTCGAAG
GTGCTGGAGG ACGCCATTCC CGCGAACCTG CGCGCAACAC CGGTCGCCGA CGGTGGTATT
CCCTGGTTTG GCCTGTCCAC GCGGGCGCGT GTGGTTGTCT ACAACAAGAC CAAGGTCAAC
AAGGACGATG TGGACACCTA CGAAGAGTTG GGCGACCCCA AAAACAAAGG CAAGGTCTGC
ATCCGTTCCG GCTCGCACCC CTACAACCTG AGCCTGTTCG GCGCCGTGAT GGAACACGTG
GGCGAACAGA AAGCCGAAGC ATGGCTCAAG GGCGTGGTGA ACAACCTCGC CCGCGCGCCC
AAGGGGGGCG ACACCGATCA AATCAAGGCC GTCGCAGCCG GCGAATGCGA TATCGCCGTG
ACCAACAGCT ATTACCTCGC CCGCCTGATG CGCTCGGACA AGTCCGAAGA CAAGGCCGTG
GTGGACAAGG TGGCGGTGGT GTTCCCCAAC CAGCAATCGT GGGGCACGCA CATGAACATC
GCAGGTGGCG CAGTGGCCCG CCACACCAAG AACCAGGCCA ACGCCATCAA GTTCCTGGAA
TACCTGGCCA GCCCTGAGGC TCAGAACTAC TTTGCCAACG GCAACAACGA ATGGCCCGCC
GCCAAAGACG TGGATCCGGG CAACCCGGCC CTCAAGGCCA TGACGGGCGG CCAACCGTTC
AAGAGTGAAA CCATCCCCAT TGGCGCGGTC GGCGCCAATA CCGTCAAGGT GCAGCAGATG
CTGGACCGCG TCGGTTTCCG GTAA
 
Protein sequence
MPMKSLVVAC VLAAAGAASA QDQVVNLYSA RHYATDEALY SGFTKATGIK INRVDSDDAG 
IMARLKAEGA ASPADVILLV DAARLYRGEA DGLFLPVRSK VLEDAIPANL RATPVADGGI
PWFGLSTRAR VVVYNKTKVN KDDVDTYEEL GDPKNKGKVC IRSGSHPYNL SLFGAVMEHV
GEQKAEAWLK GVVNNLARAP KGGDTDQIKA VAAGECDIAV TNSYYLARLM RSDKSEDKAV
VDKVAVVFPN QQSWGTHMNI AGGAVARHTK NQANAIKFLE YLASPEAQNY FANGNNEWPA
AKDVDPGNPA LKAMTGGQPF KSETIPIGAV GANTVKVQQM LDRVGFR