Gene Daci_4552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_4552 
Symbol 
ID5750140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5007426 
End bp5008535 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content70% 
IMG OID641299653 
Productputative hemagglutinin-related protein 
Protein accessionYP_001565566 
Protein GI160899984 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2706] 3-carboxymuconate cyclase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.846091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.748883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT CAGCGTCGCG CCCCCTGTAC GCCTACGTGG GTTCGCGCAC CACGCGCGAG 
CGCGACGCAC GTGGCGAGGG CATCACCGTC TATCGCGTGG ACGAAGCCAC GGGCGGCCTG
CAGCATCTGC AGACCGTGGA CGGGCTTTCC AACCCCTCGT TCCTGGCGCT CGATGCAGCC
GGCACCCGGC TCTACACCGT GCATGGCGAT GGCCATGAGG TCAGCGTCTT CGCACGCGAT
GCCGCAACGG GCCGCCTGGC GTTGCTGCAG ACCCGGGACT GCGGCGGGCG CAACCCGGTG
CACCTGGCCA TCGCCCCGGG TGGACGGCAG CTGGTGGTCT CCGACCATCT GGGCGAGCCG
GCTGCCAGCG ATGGCCACCA GGGGTATGGC GGGCCAGGCG GTACGCTGGC CGTGATGTCG
ATCGCACCGG ACGGGCGCCT GGGCTCCGTG CAGCAGCGGC TTGCGCTACC GGGCCAACCC
GGGCCGCATC GCAAGGAGCA GCCGTTTGCC AAGCCGCACT TCAACCCCTT CTCGCCGGAC
GGGCGCTTCG TGCTGGTGCC TGACAAGGGG CAGGATCGGA TCTTCATCTT CGCCTTCGAG
CACGGGCGGC TGGCGCCTGC GCCCCAGCCC TGGCTGGACT GCCGCGAAGG CTCGGGCCCG
CGGCATATGG CCTTCCATCC TGCGCTGGCC TGCGCCTATG TGGTCAACGA ACTGGACAAC
ACCGTGCTCA CCTGCCGCTT CGATGCGGCC ACGGGGGCGC TGCAGGGCTT GCAGATCCTG
TCCACCCTGC CGGAGCGCTT TGTGGGCAAC AGCCGGGCGG CGGGCATCGA GGTCTTGCGC
GACGGCCGGC AGGTGCTGGT GTCCAATCGC GGCGCTGACG GCATTGCGGT CTTCGATGTC
GATCCTTTGA CGGGGCTGCT GCACGCCAGT GGCGGCTTCG CCTCGGGCGG GCGCACGCCG
CGCTTTTTCA CGTCCTCGCC CGACGGGCGC CTGCTCTATG TGCTCAACGA GGACAGCGAC
AGCATCGTCT GCCATGCCCC GGATGACGCC TGGCGCCCGC TGGCCAGCAC CCACTGCGCC
AGCCCGGTGT GCATGGTGTT CGCGCGGTAG
 
Protein sequence
MSTSASRPLY AYVGSRTTRE RDARGEGITV YRVDEATGGL QHLQTVDGLS NPSFLALDAA 
GTRLYTVHGD GHEVSVFARD AATGRLALLQ TRDCGGRNPV HLAIAPGGRQ LVVSDHLGEP
AASDGHQGYG GPGGTLAVMS IAPDGRLGSV QQRLALPGQP GPHRKEQPFA KPHFNPFSPD
GRFVLVPDKG QDRIFIFAFE HGRLAPAPQP WLDCREGSGP RHMAFHPALA CAYVVNELDN
TVLTCRFDAA TGALQGLQIL STLPERFVGN SRAAGIEVLR DGRQVLVSNR GADGIAVFDV
DPLTGLLHAS GGFASGGRTP RFFTSSPDGR LLYVLNEDSD SIVCHAPDDA WRPLASTHCA
SPVCMVFAR