Gene Daci_4426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_4426 
Symbol 
ID5750014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp4846995 
End bp4848665 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content66% 
IMG OID641299528 
Productextracellular solute-binding protein 
Protein accessionYP_001565441 
Protein GI160899859 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.169116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.480332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAA GACGCTTCCT GCCTCGCCTT TCCCTGCTGC GCACCCACTT GCGCACCCTG 
CGAATTTCCG CACTGCTTCC CGCCCTGGCC GCAGCCCTGC TGTGCGCACC GCCCGCCCAG
GCCGAAACCC TGCGCTGGGC GCGCTCCACC GACGCATCCA CGCTGGACCC GCATGCGCTG
AACAACGGCC CCAACCACAA CCTGCTGCAT CAGATCTACG AGCCGCTGAT CATCCGCACG
GCCGATGGCA AGCTGCTGCC CACCCTGGCC ACGTCCTGGG CGCTGACTGC CGACCCCTCG
GTCTGGGAGT TCAAGCTGCG CAAGGGCGTG AAGTTCCATG ACGGCAGCCT GTTCACGGCC
GACGACGTGC TGTTCTCGCT GCGCCGCGCG CGCTCGGCCA CCTCGGACAT GCGCTCGCTG
CTGACCTCGA TCACCGACGT GACCAAGGTC GATGCCTTCA CCGTGCACAT CAGGACCAAC
GGACCCAACC CGCTGCTGCC GGCCAGCCTC ATCAACATCC AGATCTTGAG CGCCGCCTGG
GCCAAGGCCC ATGGTGCCGA GCAGCCGCAG AACGCGCTGG CCAAGGAAGA GAACTACGCC
ACGCGCAACG CCAACGGCAC GGGGCCCTAC GTCATCGCCT CGCGCGAGCA GGACACGCGC
ACCGTGCTGC GGCAGTTTCC CGGCTACTGG GGCAAGGGGC TGTTCCCGCT GGAAATCGAC
GAACTGGTCT ACCTGCCCAT CAAGTCCCAG GCCACGCGCG TGGCGGCGCT GCTGTCGGGC
GAGGTGGACT TCGTGCAGGA CCTGCCCATC CAGGACATTG CGCGCCTGAG CGCCGACCCG
CGCTTTCGCA TCAACCAGGC CGCCGAGAAC CGCACCATCT TCCTGGGTCT GAACGTGGGC
GCCGCCCCGC TGTCGCATTC CGATGTGAAG GACAAGAACC CCCTGGCCGA CCTGCGCGTG
CGCCAGGCCT TCCAGCTGGC CATCGACCGG CAGGCCATAC AGCGCGCCGT GATGCGCGGC
CTGTCCGTGC CCACCAACAT CATTGCGCCG CCCTTTGTGC ACGGCTACGA GAAATCCTTT
GGCGCCGTGG GCAAGGCCGA CCTTGTCCAG GCCAGGAAGC TGCTGGCCGA GGCCGGCTAT
CCCAACGGCT TCGGCATCAC CCTGCATTGC ACGAACGACC GTTATCTGAA CGACGAGGCC
ATCTGCCAGG CCATCGCCGG CTTTCTGGGC CGCATCGGCG TGAAGACCGC CGTGTCGTCG
CGGCCGCTGG CCATCCAGAC GGCGGCCATC AACAACCAGG AGACGGATTT CTACCTCTAC
GGCTGGGGCG TGCCCACCTA TGACTCGGCC TATGTCTTCG ACTACCTGGT GCACACGCGC
GGCAAGAACG GCCGGGGCAA CACCAATGCC ACGCGCTACA GCAATGCCGA GCTGGACAGC
CAGATCGTCT CCCTGGCCTC CGAGGGCGAT GCGCGCAAGC GCGATGCCAC CATCCACTCC
ATCTGGAGCA CGGTGCAAAA GGAGCTGATC TACCTGCCGC TGCACGACCA GATCCAGACC
TATGCCATGG TGCGCAAGTT CGACATCCCG GTGAATCCGT CGAACACGCC TTACTTCAAG
CTGTTCAAGC AGCCCGGTGC GCGCCAGGCT GCGGTGGCCG GCGCGCAGTA G
 
Protein sequence
MTQRRFLPRL SLLRTHLRTL RISALLPALA AALLCAPPAQ AETLRWARST DASTLDPHAL 
NNGPNHNLLH QIYEPLIIRT ADGKLLPTLA TSWALTADPS VWEFKLRKGV KFHDGSLFTA
DDVLFSLRRA RSATSDMRSL LTSITDVTKV DAFTVHIRTN GPNPLLPASL INIQILSAAW
AKAHGAEQPQ NALAKEENYA TRNANGTGPY VIASREQDTR TVLRQFPGYW GKGLFPLEID
ELVYLPIKSQ ATRVAALLSG EVDFVQDLPI QDIARLSADP RFRINQAAEN RTIFLGLNVG
AAPLSHSDVK DKNPLADLRV RQAFQLAIDR QAIQRAVMRG LSVPTNIIAP PFVHGYEKSF
GAVGKADLVQ ARKLLAEAGY PNGFGITLHC TNDRYLNDEA ICQAIAGFLG RIGVKTAVSS
RPLAIQTAAI NNQETDFYLY GWGVPTYDSA YVFDYLVHTR GKNGRGNTNA TRYSNAELDS
QIVSLASEGD ARKRDATIHS IWSTVQKELI YLPLHDQIQT YAMVRKFDIP VNPSNTPYFK
LFKQPGARQA AVAGAQ