Gene Daro_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3201 
Symbol 
ID3566871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3450422 
End bp3451837 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content59% 
IMG OID637681672 
ProductType I secretion membrane fusion protein, HlyD 
Protein accessionYP_286401 
Protein GI71908814 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.195077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGCC TTAAAACAGC GCTTCATGCC ACCCATGCCT GGTTGGAAAG TGTGGCCCTG 
CGCGCGGCGC CGCATGTCGA AAAGGTTCTC GGGAGATTGC CGAACCGGGA AGAGGTCGAA
GTGGTTGATT TCGCGACGGA TGCCGATCTG GCGATGCTGC GTCAGGAGCC GTTGCGCGCA
CGCGTCCTTT TGCGCTCGAT CAGCGTCGTA TTCATCATGT TTGTGCTTTG GGCCGCCATT
GCCCAGCTCG ACGAGGTGAC CCGGGGTGAA GGCCGGGTGA TTCCGTCGCG CCAGATCCAG
ATTCTTCAGA GCATCGACGG CGGGCTGGTT TCAGAAATTC TGGTCAAGGA AGGCGAGGTC
GTTCAGCCCA ACCAGTTGCT GATCAAGATC GATGAAACGC GTTTCGTCTC CTCGGTGAAG
GAAAATCAGG CTCAGTATCT GGGTTTGGTT GCCAAAGCGG CTCGCCTGCG GGCGATTTCT
GAAGGCAAGC CGTTTGTGCC GCCGCCGGAA GTGCTCAAGG CTGATCCTTC GATCGTCCAG
CAGGAACGTC AGTTTTACGA GGCGAGAAAC GACGAGCTGA ATGCCACGGT GTCCATAGCC
CGCCAGCAAC TCGCCCAGCG CCAGCAGGAA CTCAACGAAG CCCAGGCCAA GAGATCGCAG
GCTTCACAAG GCTATGACCT GACCTCCAGG GAACTGGCGG TGACCAAGCC CTTGATCAAC
TCCGGAGCCG TCTCCGAGGT CGAGTTGCTG CGTCTTGAGC GCGACGTTTC GCGCTACCGC
GGTGAGCGTG ACATGGCCTC GGCCCAGATT TCCAGGGTTC AGGCCTCGAT TCATGAAGCC
CAGCGCAAGA TCGAGGAGGT CGAGCTGACC TTCCGCAACG ATGCCAGCAA GGAACTTTCC
GAAACAATGG CCAAGCTGAA CAGCCTGGCA GAAGGCAGCG TCGCGCTGTC AGATCGGGTC
AAGCAGTCGT CGATTCGTTC GCCGGTCAAG GGCACGGTCA AGCGCCTGCT GGTCAATACC
GTCGGTGGCG TCGTTCAGCC GGGCAAGGAC ATGATCGAAA TCGTTCCATT GGAAGACACG
CTATTGCTGG AAGCCAGGGT GTTGCCGCGC GATATCGCCT TCCTGCGGCC CGGGCAGCCG
GCCATGGTCA AGTTCACAGC CTACGATTTC TCCATTTATG GCGGGCTGGA TGGAACGCTC
GAACACATCG GCGCCGACAG CGTCATTGAC GAAAAGGGCA ATGCCTTCTA TACCGTGCGG
GTTCGGACCA ACAAACCCGG TTTTGGCAAT GCCAACCTGC CGATTATTCC TGGCATGGTG
GCAGAGGTCG ATATTCTGAC GGGCAAGAAG AGTGTCCTGG CCTATCTGAT CAAGCCAGTC
CTCAGGGCCA AGAGCGTGGC TTTGACGGAA CGCTGA
 
Protein sequence
MSRLKTALHA THAWLESVAL RAAPHVEKVL GRLPNREEVE VVDFATDADL AMLRQEPLRA 
RVLLRSISVV FIMFVLWAAI AQLDEVTRGE GRVIPSRQIQ ILQSIDGGLV SEILVKEGEV
VQPNQLLIKI DETRFVSSVK ENQAQYLGLV AKAARLRAIS EGKPFVPPPE VLKADPSIVQ
QERQFYEARN DELNATVSIA RQQLAQRQQE LNEAQAKRSQ ASQGYDLTSR ELAVTKPLIN
SGAVSEVELL RLERDVSRYR GERDMASAQI SRVQASIHEA QRKIEEVELT FRNDASKELS
ETMAKLNSLA EGSVALSDRV KQSSIRSPVK GTVKRLLVNT VGGVVQPGKD MIEIVPLEDT
LLLEARVLPR DIAFLRPGQP AMVKFTAYDF SIYGGLDGTL EHIGADSVID EKGNAFYTVR
VRTNKPGFGN ANLPIIPGMV AEVDILTGKK SVLAYLIKPV LRAKSVALTE R