Gene Daci_5086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_5086 
Symbol 
ID5750697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5640154 
End bp5641380 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content68% 
IMG OID641300210 
Productmajor facilitator transporter 
Protein accessionYP_001566100 
Protein GI160900518 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.90358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0097128 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACAAAA ACAACCAACC CATGTCCATG GTGCAGATAC TGCTGTGCGG GGGCGCAGTC 
GTGACGCTGT CCATGGGAAT CCGCCACGGC TTCGGCCTGT GGCTGCAGCC CATCACCCAG
GAGATGGGCT GGACGCGCGA GTCCTTCGCG CTGGCCATCG CCATCCAGAA CCTGTCCTGG
GGCGTGCTCG GCATCTTCGG CGGCATGCTG GCCGACCGCT TCGGCGCCTT CCGCGTGCTC
CTGGTGGGCG CGCTGCTGTA TGCGCTGGGC CTGGCCGGCA TGGCCATGGC GCCCACGACA
ACCTGGTTCG CCCTGACGGC CGGCGTGGTG ATCGGCGCAG CCCAGGCAGG CACCACCTAT
GCCGTGATCT ACGGTGTGCT GGGCCGCCAG ATTCCCGTGG CGCGGCGCAG CTGGGCCATG
GGCGTGACGG CGGCGGCGGG CTCCTTCGGC CAGTTCTTCA TGGTGCCCGT GGAAGGCAGC
CTGATTTCGA ACTTCGGCTG GTCCAACGCC CTGCTGCTGC TGTCGCTGTG CGCGCTGATG
ATCATTCCGC TGGCCTTCGG CCTGCGCGAG CCGGGCTTCC AGCCCGGCAA CGCCCGTCCC
GTGCGCGACC AGAGCGCGGG CCAGGCCGTC GCCGAAGCGC TGCGCACGCC CAGCTTCGTG
CTGCTGACGG CCGGCTACTT CGTCTGCGGC TTCCAGGTGA TGTTCATCGG CGTGCACATG
CCCAGCTACC TCAAGGACTA CGGCCTGGCA CCGCAGGTGG CCAGCATCTC GCTGGCGCTG
GTGGGGCTGT TCAACATCGT TGGCACCTAC GTGGCCGGCA ACCTGGGCCA GCGCCTGCCC
AAGCGCTACC TGCTGTCCAC CATCTACTTC ACGCGCTCGG TGGTGATCGT GCTCTTCCTG
CTGGCGCCGC TGACGCCGTG GTCGGTCTAC ATCTTCTCTG CCGCCATGGG CCTGCTGTGG
CTGTCCACCG TGCCGCTGAC CAACGCCACC GTGGCCCAGA TCTTCGGCGT GCAGCATCTG
TCCATGTTGA GCGGCATGGT GTTCTTCAGC CACCAGGTGG GCAGCTTCCT GGGCGTCTGG
CTGGGCGGCT ATCTCTATGA CCACACGGGC AGCTACCAGG TGGTCTGGTA CCTGGCCATC
GGCCTGGGCG TGGCCGCCGG CCTGCTGAAC CTGCCCATAC GCGAAGCCCC GGTGGCACGG
TTGCGCGCGG CCCAGGCCGC TGCCTGA
 
Protein sequence
MHKNNQPMSM VQILLCGGAV VTLSMGIRHG FGLWLQPITQ EMGWTRESFA LAIAIQNLSW 
GVLGIFGGML ADRFGAFRVL LVGALLYALG LAGMAMAPTT TWFALTAGVV IGAAQAGTTY
AVIYGVLGRQ IPVARRSWAM GVTAAAGSFG QFFMVPVEGS LISNFGWSNA LLLLSLCALM
IIPLAFGLRE PGFQPGNARP VRDQSAGQAV AEALRTPSFV LLTAGYFVCG FQVMFIGVHM
PSYLKDYGLA PQVASISLAL VGLFNIVGTY VAGNLGQRLP KRYLLSTIYF TRSVVIVLFL
LAPLTPWSVY IFSAAMGLLW LSTVPLTNAT VAQIFGVQHL SMLSGMVFFS HQVGSFLGVW
LGGYLYDHTG SYQVVWYLAI GLGVAAGLLN LPIREAPVAR LRAAQAAA