Gene Daci_5092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_5092 
Symbol 
ID5750703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5645153 
End bp5646283 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content67% 
IMG OID641300216 
Productcupin 4 family protein 
Protein accessionYP_001566106 
Protein GI160900524 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.581531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0289513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCA ACACACCGCT TGCCCTGCTG GGCGGGCTTA CCGCTTCCCA ATTCATGCGC 
CGCCACTGGC ACAAGAAGCC CTTGCTGGTG CGCCAGGCAA TCCCGGGCTT CAAGCCCCTG
ATTCCCCGCG CCAGGCTGCT GGCCATGGCA GGCGAGGACG GTGTGGAGTC GCGCCTGATC
CAGCAGCAGG ACGGTGGCCA ATGGAAGCTC AGCCACGGCC CGCTGTCGCG CCGCAGCCTG
CCCTCGCTGC AAAAGCCCGG ATGGACCGTG CTCGTGCAGG GCGTGGACCT GCACGACGAC
GGCGTGCACC AGCTGATGCA GCAGTTCCGC TTCGTGCCCG AGGCGCGGCT GGACGATCTG
ATGATCAGCT TTGCCACCGA CCAGGGCGGC GTGGGCCCGC ATTTCGACAG CTACGACGTC
TTCCTGCTGC AGGCGCATGG CCGCCGGCGC TGGCGCATCG GCCGCCAGAA GGACCTTTCG
CTGCAACCCG ATGTGCCGCT GAAGGTGCTC TCGAATTTCG AGCCCGAGGA GGAGTTCGTG
CTCGAGCCCG GTGACATGCT CTACCTGCCG CCCAAGTGGG CCCATGACGG CATCGCCGAG
GGCGAGTGCA TGACCTACTC CATCGGCTTT CGCTCGCCCG CGCGCGACGA ACTGGCCCGC
GAGCTGCTGC TGCGCATGTC CGACGAGCCC GATGAACCCG AAGCGCCCAT GGTCTACCGC
GATCCCGACC AGCCCGCCGT CGAGGCTCCG GGCGAGATTC CGTCGAGCCT GCACGACTTC
GCGCGCAAGG CGCTGGAGCG CGCGCTGGCC GAGCCGCTGG CGCTGGAGCG CGCGCTGGGC
GAGTACATGA CCGAGCCCAA GGCCAATGTC TGGTTCGAGC ATGGCGAGGA GCACGGCATG
TTCGAGAGCG TGGTCCTCGA TCGCCGCACG CGCATGATGT ATGACGCAAA ACACATCTTC
ATCAACGGCG AAAGCTATCT GGCCGGTGGC CGCGATGCCA CCCTGATGCG CAAGCTGGCC
GATACGCGCG CCCTGTCACG CGCCGACCTG GCCAAGGCCA GCGATGACGC GCTGGAGCTG
CTGTCCTCCT GGTTTGACGC CGGCTGGGTG CGAGGCGGGC CGCTGTCCTG A
 
Protein sequence
MDTNTPLALL GGLTASQFMR RHWHKKPLLV RQAIPGFKPL IPRARLLAMA GEDGVESRLI 
QQQDGGQWKL SHGPLSRRSL PSLQKPGWTV LVQGVDLHDD GVHQLMQQFR FVPEARLDDL
MISFATDQGG VGPHFDSYDV FLLQAHGRRR WRIGRQKDLS LQPDVPLKVL SNFEPEEEFV
LEPGDMLYLP PKWAHDGIAE GECMTYSIGF RSPARDELAR ELLLRMSDEP DEPEAPMVYR
DPDQPAVEAP GEIPSSLHDF ARKALERALA EPLALERALG EYMTEPKANV WFEHGEEHGM
FESVVLDRRT RMMYDAKHIF INGESYLAGG RDATLMRKLA DTRALSRADL AKASDDALEL
LSSWFDAGWV RGGPLS