Gene Daci_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_2234 
Symbol 
ID5747797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp2451435 
End bp2453543 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content69% 
IMG OID641297318 
ProductEAL domain-containing protein 
Protein accessionYP_001563259 
Protein GI160897677 
COG category[G] Carbohydrate transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC
[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0455716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAACA AGACGTCACG GGCCGGCTTT CCGGGCGTGC AACTGTTCCT GACACAGCGC 
CTGGCCAGGC TGGCCGGCGC CAACTCCATG CGGGCCATAC GCGAGGGCCT GCTCTGGCTG
GTTCCCTGTC TGCTGGTCTC GGCCCTGTTC CTCATCCTGT CGGCGCTGGC GCAGATGTCC
GGCCAGCCCG AACCCGTGGT GCGGGTGCTT GCGGGCCTGC ACGCGGAGAT CGGCAGCATC
CTGCCGCTGC TGGTGGCCGC GTCCATCGGC TACATGCTCT CAATACGCCA CCGGCTGCCG
CGCCTGCCGG TGACCTTTCT GTGCTTCGCG CATGTGCAGA TGGCCATTTA TCTGCTGAGC
GAGCATCCGC GCGCGGCGGC CACCCTGGTG CTGTTCATCT CCATCGCCTC CCCCCTGATC
ACCGTGCCCC TGATGGCGCG CCTGAGCCGG CTGCGCTGGA CGCGCATCGC GCGCACCGGC
TTCGTGAGCG AGAACGTGCG CGAGGTCATG AACCTCGTCG TTCCCGGCGC CCTTGTCGCC
GTGCTGCTCG TGCTGCTGCT GCTGGGCCTG CAACAGGTGC TGCCGGATAT CACGCGGGCC
GAACTGCCGC TGGCCATCGC GGGCCCCGAG AACCCCTACC GCAGCGGGCT CACGCTGGCA
CTGCTCAATT CCGTGCTCTG GTTCTTCGGC GTCCAGGGGT ACTACGCCAT GCAGCCCTTC
TTCCAGGTGC TGGACCAGGC GGTGGTGGCC AATGCCGCCG CGATGGCGCA GGGGCTGCAG
GCGCCCTGGG CGCTCAGCGG CGGGCTGATG GGCTCCTTCG TCTTCATCGG CGGCTCCGGT
GCAACGCTGT CGCTGGCGCT GGCCGTGCTG CTGTTTTGCC GGGGGCGGGG CCTGCGCGTG
CTGGCGCTGG CGGCGCTGCC CATCTCCTTG CTCAACGTCA ACGAGATCCT GCTGTTCGGG
CTGCCGATCA TCCTGAACCT GCGCCTGCTC GTGCCGTTTC TCGCGGTACC CGCCATCAAC
CTGGTCGTGG CCGTCACGGT GGTGCAGGCC GGCTGGGTGG CACCGGCCTC CATGGTGCTG
CCGCTGACTG CGCCCGTGGT CTTCAATGCC TATGTGAGCA CGGGCGGGGA CATGGCAGCC
GTGGTACTGC AACTGGCATT GACGGCCCTG GGAGCGCTGA TCTATGCGCC CTACGTGCGG
GCCATCGACC GGCTGGGCCA GGACGACGGT GCCATCGTGC TGCACGCGCT GGACACCACG
TTCTCGCGCC TGCCCGAAGA GGCTGGCCTG ATCGTGCGCG ATCCGCTGGT GCAGGCGCAC
CAGGCGCAGG CCCGGCGCGA GACCATGCTG GCGCATATCC GTCGCATCAG CGAGTACGAG
TTCCACCTGG AATTCCAGCC CCAGGTCTCG CACCGCAGCG GCCTGTGCCT GGGCTGCGAG
GCCCTGCTGC GCGCGCGGGA CGCGCAGGGC CGCTTGCAGC AGCCCGGCAG TTTCCTGCGC
TGGCTGGCCG ATGCGGGGCT GATGCGCGAG GTGGACCTGT GGGTGGCGGG CGCTGCCGTG
CGCCAGAGCC GGCGCTGGCG CCAGGAGGGC TTTGCCATGC CGATCAGCAT CAATGTGTCG
GGTGCGACGT TGACATCGCC CGAATACTGC TCGCGCCTGA TGCTGCTGCT GGCCCAGGCC
AGAGGGCAGG TCGGCGTGGA GATCACGGAG GAAGAAATGG TGGGCGACGT GGAGGCGATC
CGCCGCGCCA TAGGGCAGAT CCATGCGCTG GGCGCCAAGG TGTCCATCGA TGACTTCGGC
ACCGGCTTCT CGTCCATGAG CTACCTGCAC CAGTTCGACG TGGATGCCAT CAAGATCGAC
CGCAGCTTCG TCGTGGCCAG CGGCCATGCC AAGGGCGAGC TGGTGCTGGA CGGCCTGCTG
CGCTTTTGCG AGGCGCTGCA GCTGAACGTG GTGGCCGAGG GCGTGGAGAC CGAGGCGCAG
TTGCAGGCCC TGGGCTTCGA GGGCGAGCTG CTGGTGCAGG GCTGGTACTT CAGCCGCGCC
CTGCCGGGCG AGAAACTGCC GCAGTTCGCC AGGGACTGCG CCGCCAGGGC AGGCGGCGGG
ACGGTCTAG
 
Protein sequence
MGNKTSRAGF PGVQLFLTQR LARLAGANSM RAIREGLLWL VPCLLVSALF LILSALAQMS 
GQPEPVVRVL AGLHAEIGSI LPLLVAASIG YMLSIRHRLP RLPVTFLCFA HVQMAIYLLS
EHPRAAATLV LFISIASPLI TVPLMARLSR LRWTRIARTG FVSENVREVM NLVVPGALVA
VLLVLLLLGL QQVLPDITRA ELPLAIAGPE NPYRSGLTLA LLNSVLWFFG VQGYYAMQPF
FQVLDQAVVA NAAAMAQGLQ APWALSGGLM GSFVFIGGSG ATLSLALAVL LFCRGRGLRV
LALAALPISL LNVNEILLFG LPIILNLRLL VPFLAVPAIN LVVAVTVVQA GWVAPASMVL
PLTAPVVFNA YVSTGGDMAA VVLQLALTAL GALIYAPYVR AIDRLGQDDG AIVLHALDTT
FSRLPEEAGL IVRDPLVQAH QAQARRETML AHIRRISEYE FHLEFQPQVS HRSGLCLGCE
ALLRARDAQG RLQQPGSFLR WLADAGLMRE VDLWVAGAAV RQSRRWRQEG FAMPISINVS
GATLTSPEYC SRLMLLLAQA RGQVGVEITE EEMVGDVEAI RRAIGQIHAL GAKVSIDDFG
TGFSSMSYLH QFDVDAIKID RSFVVASGHA KGELVLDGLL RFCEALQLNV VAEGVETEAQ
LQALGFEGEL LVQGWYFSRA LPGEKLPQFA RDCAARAGGG TV