Gene Daro_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4003 
Symbol 
ID3567214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4300611 
End bp4302614 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content59% 
IMG OID637682476 
ProductAcetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_287200 
Protein GI71909613 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.666441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.37304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACTA AGATTCTGAT CGCAAACCGC GGCGAAATTG CCTGCCGCGT CATCAAGACC 
GCCCGCAAGA TGGGCATCAA GACGGTTGCC GTCTATTCCG AAGCCGACAA GGACGCCTTG
TTCGTCGATA TGGCCGATGA AGCCGTCTGT ATCGGCCCGG CTGCGTCGAA AGAGTCCTAT
CTGGTTGCCG ACAAGATCAT CGCCGCCTGC AAGCAGACCG GTGCTCAGGC CGTCCATCCG
GGCTACGGCT TTCTGTCGGA AAACGCAGGG TTCTCGCGTC GCCTCGAAGA AGAAGGCATC
AAGTTCATCG GTCCGAAGCA CTACTCCATC GCCAAGATGG GCGACAAGAT CGAGTCCAAG
AAGCTGGCCA TCGAAGCCAA GGTCAACACC ATCCCGGGCT ACAACGAGGC CATCGCGGGT
CCTGACGAAG CCGTCAAGAT CGCTCAGGGC ATCGGCTATC CGGTCATGAT CAAGGCTTCC
GCAGGCGGCG GCGGCAAGGG CCTGCGCGTG GCCTACAACG ATGCCGAAGC GCACGAAGGT
TTTTCCTCTT GCGTCAATGA AGCCCGAAAT TCCTTCGGTG ACGACCGCGT CTTCATCGAA
AAGTACGTGC TCGAACCGCG TCACATCGAA ATTCAGGTGC TCGGTGACTC GCACGGTAAC
TACGTGTACC TGAACGAGCG CGATTGCTCG ATCCAGCGTC GTCACCAGAA GGTCATCGAA
GAAGCGCCGA GCCCCTTCGT CGATCCCGAG ATGCGCAAGG CGATGGGCGA ACAGGCCGTA
GCCCTGGCCC GTGCTGTGAA TTACGAGTCG GCCGGTACGG TCGAGTTCGT GGTGTCCGGT
GCGACCAAGG AGTTCTACTT CCTGGAAATG AACACCCGCC TGCAGGTGGA ACACCCGGTC
ACCGAACTGA TTACCGGTCT CGACCTCGTC GAGCAGATGA TCCGTGTCGC CTACGGCGAA
AAGCTGCCGC TGACCCAGGC TGAAGTGAAG ATCGACGGCT GGGCGATGGA ATGCCGGATC
AACGCCGAAG ACCCGTTCCG CGGCTTCCTG CCCTCCACCG GCCGTCTGGT CAAGTTCCTG
CCCCCCAAGG AAGTGCCGGG TCACGTGCGC GTCGATACCG GCGTTTACGA CGGTGGCGAG
ATCTCGATGT TCTACGACTC GATGATTGCC AAGCTGATCG TCCATGGTGC CACCCGCGAG
CAGGCCATCG CCCGCATGCG CGATGCGCTG AACGGCTTCG TCATTCGCGG CATTTCCTCG
AACATCCCGT TCCAGGCCGC GCTGATGCAG CATCCGGTTT TCCATTCCGG CATCTTCGAT
ACCGGTTTCA TCCCCAAGCA CTACCCGACG GGCTTCGATG CCTCGATGGT GCCGCACGAT
GATCCGGCCC TGCTGGTTTC CGTTGCCGCC TACGTCTATC GTGCCTTCAC CGACCGTTCT
GCCTCGATCA CCGGTCAGTT GCAGGGTCAC GAGCGCCTGG TCAGTGACAA CTGGTGTGTC
GTTCGCCTGA ACCCGAATGG CAACGAGAAT CATATGGTTG TCGCCCGTCC GATTCCAGGT
GGCTACCATG TCGAGTACAA GGGTGAGCAG TACGAAATCC TGTCCGACTG GAAGCTGGGT
GAGTCGCTGT TCAACGGCAC CTGCAACGGT GAGGAATTCA CGCTGCAGGT CGAGCGTCAC
AAGACTCGCT ACAGCCTGTT CCACTGGGGC ACACGTGCCG ATTTCATGGT GATGAGCGCC
CGTGCAGCTG AACTACTGGC CTTGATGCCT GAAAAGCAGG CGCCTGACCT CACCAAGTTC
CTGATCTCCC CGATGCCTGG TCTGCTCCGC GAAGTGGCGG TCAAGGTTGG TCAGGACGTC
AAGGCCGGCG AAAAGTTGGC GGTCATCGAA GCCATGAAGA TGGAAAACAT CCTTAAGGCC
GACCAGGACT GCAAGGTCAA GAAGATCTCG GCAGCGGCCG GCGAAAGCCT GACCGTGGAT
CAGATCATCA TTGAATTCGA GTGA
 
Protein sequence
MFTKILIANR GEIACRVIKT ARKMGIKTVA VYSEADKDAL FVDMADEAVC IGPAASKESY 
LVADKIIAAC KQTGAQAVHP GYGFLSENAG FSRRLEEEGI KFIGPKHYSI AKMGDKIESK
KLAIEAKVNT IPGYNEAIAG PDEAVKIAQG IGYPVMIKAS AGGGGKGLRV AYNDAEAHEG
FSSCVNEARN SFGDDRVFIE KYVLEPRHIE IQVLGDSHGN YVYLNERDCS IQRRHQKVIE
EAPSPFVDPE MRKAMGEQAV ALARAVNYES AGTVEFVVSG ATKEFYFLEM NTRLQVEHPV
TELITGLDLV EQMIRVAYGE KLPLTQAEVK IDGWAMECRI NAEDPFRGFL PSTGRLVKFL
PPKEVPGHVR VDTGVYDGGE ISMFYDSMIA KLIVHGATRE QAIARMRDAL NGFVIRGISS
NIPFQAALMQ HPVFHSGIFD TGFIPKHYPT GFDASMVPHD DPALLVSVAA YVYRAFTDRS
ASITGQLQGH ERLVSDNWCV VRLNPNGNEN HMVVARPIPG GYHVEYKGEQ YEILSDWKLG
ESLFNGTCNG EEFTLQVERH KTRYSLFHWG TRADFMVMSA RAAELLALMP EKQAPDLTKF
LISPMPGLLR EVAVKVGQDV KAGEKLAVIE AMKMENILKA DQDCKVKKIS AAAGESLTVD
QIIIEFE