Gene Daro_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3938 
Symbol 
ID3567476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4233660 
End bp4235018 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content57% 
IMG OID637682412 
Productacetyl-CoA carboxylase biotin carboxylase subunit 
Protein accessionYP_287136 
Protein GI71909549 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0000039131 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAAA AAATCCTCAT TGCCAACCGG GGCGAAATCG CGCTGCGTAT CCAGCGTGCC 
TGCCGCGAGC TTGGCATCAA GACTGTGGTC GTGCACTCCG AGGCCGATCG TGATGCCAAG
TACGTCAAAC TGGCCGACGA GTCGGTCTGT ATCGGCCCGG CTTCTTCTGC CCTCAGCTAC
CTGAACGTTC CGGCGATCAT TTCGGCGGCA GAAGTCACCG ATGCCCAGGC GATTCACCCG
GGTTATGGCT TCCTGTCCGA GAATGCCGAC TTTGCCGAGC GCGTTGAAAC CTCCGGCTTC
GTCTTTATCG GCCCGAAGGC CGAAACCATT CGCCTGATGG GCGACAAGGT GTCGGCCAAG
GATGCGATGA AAGTGGCCGG TGTGCCCTGC GTTCCAGGTT CCGAGGGCGA GTTGCCGGAT
GACCCTAAGG AAATCGTCAA GATCGCCCGT GCGGTGGGTT ATCCAGTGAT CATCAAGGCC
GCCGGTGGTG GCGGTGGTCG CGGCATGCGC GTCGTTCATA CCGAGGCTGC CCTGGTCAAT
GCCGTGCAAA TGACCAAGCA GGAAGCCGGC AGCTTTTTCG GCAACCCTGC GGTCTATATG
GAAAAGTATC TGGAAAATCC GCGTCACGTG GAAATCCAGG TGCTGGCCGA CCAGCATGGC
AGCGCCATTT ATCTGGGCGA GCGCGATTGC TCCATGCAGC GTCGTCACCA GAAGGTGATT
GAAGAGGCAC CGGCACCGCA CATCGCGCCG CGCCTGATCA ACCGTATTGG CGAGCGCTGC
GCCGAAGCCT GTCGCAAGAT CGGTTACCGT GGTGCAGGTA CCTTCGAATT CCTTTACGAG
AACAACGAGT TCTATTTCAT CGAAATGAAC ACCCGTGTTC AGGTTGAGCA TCCGGTGACC
GAGATGATCA CTGGCGTCGA TATCGTTCAG GAACAGATTC GCGTCGCCTT TGGCGAAAAG
CTGCGCTACA AGCAGAAGGA CATTGTCTGC CGTGGCCATG CCATCGAGTG CCGTATTAAC
GCCGAAGATC CCTTCACGTT CGTGCCATCT CCCGGCAATA TCACGTTTTA TCACCCACCG
GGTGGCCCAG GTATCCGCGT TGATTCGCAC ATTTATCAGG GTTACAAGGT GCCGTCGCAT
TACGACTCGA TGGTTGCCAA GGTGATTTCC TATGGTGATA CTCGCGAGCA GGCCATTCGT
CGCATGCGCA TCGCCTTGTC CGAGATGAGC ATCCAGGGCA TCAAGACCAA CATCCCGTTG
CACCAGGAAC TGATGCAGGA CGCCCGTTTT GTTGAGGGCG GCACCAGCAT CCACTACCTT
GAACAGAAAC TCGCCGACAA GGGCGAAGTG AAGGCTTAG
 
Protein sequence
MFEKILIANR GEIALRIQRA CRELGIKTVV VHSEADRDAK YVKLADESVC IGPASSALSY 
LNVPAIISAA EVTDAQAIHP GYGFLSENAD FAERVETSGF VFIGPKAETI RLMGDKVSAK
DAMKVAGVPC VPGSEGELPD DPKEIVKIAR AVGYPVIIKA AGGGGGRGMR VVHTEAALVN
AVQMTKQEAG SFFGNPAVYM EKYLENPRHV EIQVLADQHG SAIYLGERDC SMQRRHQKVI
EEAPAPHIAP RLINRIGERC AEACRKIGYR GAGTFEFLYE NNEFYFIEMN TRVQVEHPVT
EMITGVDIVQ EQIRVAFGEK LRYKQKDIVC RGHAIECRIN AEDPFTFVPS PGNITFYHPP
GGPGIRVDSH IYQGYKVPSH YDSMVAKVIS YGDTREQAIR RMRIALSEMS IQGIKTNIPL
HQELMQDARF VEGGTSIHYL EQKLADKGEV KA