Gene Daro_3814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3814 
Symbol 
ID3567970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4098547 
End bp4099941 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content63% 
IMG OID637682288 
Productaromatic hydrocarbon degradation protein 
Protein accessionYP_287012 
Protein GI71909425 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.0090168 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0676668 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAA GAAAAATCAT GGCCTCGGTG CTGGTTTCGG GTCTGGCCGG CTTTAGCAGC 
CTGGCGCAGG CGACTGATGT GTTCCGCCTG GAAGGTTACG GCGCGGTATC GCGCGGTATG
GGTGGTACGG CGGTCGCCCA CGATGTCGGC CCGGCCGGCA TGATGACCAA TCCGGCCACG
CTGTCGCTGA TGCAGGAGGG CGACCAGGTC ATGGGCGGGC TGGATCTGGT GACGACCGAC
ATCGAGGTCA GGAACAAGAA CACCGGCGAG CGCGTTTCGT CGGGCGAACA CGCCAGTAAC
CGGGGGCCGT ATGCTGCGCC GGAGCTGGCC TATACCAAAC GCTTCGGCGA CTGGGCGGTG
GGCGTCGGGG CTTTTGCCCA GGGCGGCCTC GGCACCGAAT ACGGTACCGG TAGCTTCCTG
TCGAGAGCGG TCGGTGGACT CAACACCGGC CTCGATAATT CCAGCCGCCT GCTGGTCCTG
AACATTCCCT TTGCCGCGAG TTTCAAAGTC AGCGAAAAAC TGGCCGTCGG CGGCAGCTTC
GATGCGATGT GGCAGGGCCT GAACCTGAAC CTGCTGCTCG GCGCCGATCA GGTCGGCAGC
CTGCTGAGCT CCGGCCGGGC GACCGGTACC CTGGTGCCGG TTCTGGGTGG CCTGCCTGAT
CTGCGCGGCG CTCACTTCAG CCTGACCAAG AACCAGCCGC TGGGCAGCGG CGTCGATGCC
TGGGGCTACA GCGGCAAGCT GGGCATGATC TACAAGGCAA CCCCGGAGAC GACACTGGGC
GCCTCCTATA CCTTCAAGAG CCAGATGGAC GACATGGAGG GTGGTGCCAC CCTGACGGCG
GTGGACGGCA TTGCCGGCCA GATTCCGCTG AAGGGCAAGA TCAAGATCCA GGATTTCCAG
ATGCCGGCCC ATCTCGATCT CGGCTTCAAC CAGCGCCTGT CGGCACAATG GACGGTCGCG
GTCGATGTCT CGCAGGTCTT CTGGAAGGAT GTGATGAAGG ACATCAAGGT GGCCTTCGTG
GCTGACCCGA GCGCGGCCGT GCCGACCGGC GGCACGCTCA ATATCCTGCT GCCGCAGGAC
TACAAGGATC AGACCATCCT GTCCCTGGGC ACGGCCTACG ACCTGAGCGA TCAGCTGACG
CTGCGCGGCG GTCTGCGTTT CGCGACCCAG GCCTTGCGCT CCTCGACGCT GTTTGCGGTG
ATCCCGGCCA CGCCGAGAAC GCATTTGTCG GCCGGCCTGA CCTATGCCCT GTCGAAGCAG
AGCAAGATCG ATTTCGCCTA CTCCCACGCC CTCAAGGAAA CGATGGATAA CAGCAGCCTG
CCGAACACCT CCGATCCGAT TCAGGTCAAG CACGCACAGA ACAACGCGAC CATCAATTTC
CGCTATAACT TTTGA
 
Protein sequence
MTTRKIMASV LVSGLAGFSS LAQATDVFRL EGYGAVSRGM GGTAVAHDVG PAGMMTNPAT 
LSLMQEGDQV MGGLDLVTTD IEVRNKNTGE RVSSGEHASN RGPYAAPELA YTKRFGDWAV
GVGAFAQGGL GTEYGTGSFL SRAVGGLNTG LDNSSRLLVL NIPFAASFKV SEKLAVGGSF
DAMWQGLNLN LLLGADQVGS LLSSGRATGT LVPVLGGLPD LRGAHFSLTK NQPLGSGVDA
WGYSGKLGMI YKATPETTLG ASYTFKSQMD DMEGGATLTA VDGIAGQIPL KGKIKIQDFQ
MPAHLDLGFN QRLSAQWTVA VDVSQVFWKD VMKDIKVAFV ADPSAAVPTG GTLNILLPQD
YKDQTILSLG TAYDLSDQLT LRGGLRFATQ ALRSSTLFAV IPATPRTHLS AGLTYALSKQ
SKIDFAYSHA LKETMDNSSL PNTSDPIQVK HAQNNATINF RYNF