Gene Daud_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1016 
Symbol 
ID6026017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1067535 
End bp1068875 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content63% 
IMG OID641593828 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_001717160 
Protein GI169831178 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000161184 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAGG TATTGATTGC GAATCGGGGA GAGATCGCGC TCCGGATTAT TCGTGCCTGC 
CGCGAACTGG GCTTAAAGAC GGTCGCCGTC TATTCGGAGG CCGACCGTGA CAGCCTGCCA
GTTCGTCTGG CGGACCAAGC CTACTGCATC GGTCCGGCGG ACGTGACCCG CAGTTACCTG
AACATCGCCG CCATCATCAG CGCCGCCGAA CTATCGGGGG CCGACGCCAT CCACCCGGGC
TACGGTTTTC TGGCCGAAAA CGCCCACTTC GCCGAAGTCT GCGAAAGCTG CGGCCTCACT
TTTGTCGGGC CGCCTGTTGA GGCTATCGCC AAAATGGGGG CCAAAGCGGA AGCACTGGAA
CTGGTTAGGA AAATGGGCGT GCCCATCGTG CCGGGCTCGG GAGGGGCCGT AAACGGATTA
GACCACGCCC TCGCGGTGGC GGAGGAAATC GGGTATCCCG TCCTGATCAA GGCTTCCGCC
GGTGGTGGCG GACGGGGGAT GCGGGTGGCC CACAACAAGT CCGACCTGTC CCGGGCCCTG
CAGGCGGCCC AGAGCGAAGC CCAGAACGCG TTCGGCAGTG GCGACGTCTA CCTGGAAAAG
TATATCGAGG AGCCGCGTCA CATCGAGATT CAGGTTTTCG GGGACCGGCA CGGCAACATT
CTATCCCTCG GGGAGCGGGA CTGTTCCATC CAACGGCGGA ACCAGAAACT GCTCGAAGAG
GCGCCGTCCA CCGCACTCAC TCCGGAGCTG CGCCGGCAGA TGGGGGATGC GGCCGTGCGG
GCGGCGCAGT CTGTGGGCTA CTACAACTCC GGTACGGTGG AATTTCTGCT AGACCGCGAG
AACCGGTTTT ACTTCATCGA AATGAACACC CGGATTCAGG TCGAACACCC GGTAACGGAA
ATGGTGACCG GCCTGGACCT GGTGAAGGAA CAGATCCGGG TGGCCGCCGG AGAAAAGCTG
TCGTTCGGTA AGGACGACGT GAAGATTAAC GGCTGGGCGA TCGAGTGCCG GATCAACGCC
GAGGACCCCT GGGCGAACTT CGGTCCGCGG GCGGGCCGGA TTACGGCTTA CCTCCCGGCG
GGCGGGCCGG GGATACGGGT GGACAGCGCG GTCTATCCCG GTTGCGTAGT GCCCCCTTAC
TACGACTCCC TGTTGGCCAA GCTGGTGGCC TGGGGCCGGG ACCGCGACGA AGCCATCGAC
CGGGCCGAGC GGGCACTCGC GGAATTCGTG GTTGAGGGAG TGCCGACCAC GATACCTTTC
CACCAGCGGG TGTTGTCCAA CGCCTTTTTC CGGAGGGGCG AGGTCTACAC CAATTTCGTC
CAGCGGCGCA TATTCCCCTG A
 
Protein sequence
MEKVLIANRG EIALRIIRAC RELGLKTVAV YSEADRDSLP VRLADQAYCI GPADVTRSYL 
NIAAIISAAE LSGADAIHPG YGFLAENAHF AEVCESCGLT FVGPPVEAIA KMGAKAEALE
LVRKMGVPIV PGSGGAVNGL DHALAVAEEI GYPVLIKASA GGGGRGMRVA HNKSDLSRAL
QAAQSEAQNA FGSGDVYLEK YIEEPRHIEI QVFGDRHGNI LSLGERDCSI QRRNQKLLEE
APSTALTPEL RRQMGDAAVR AAQSVGYYNS GTVEFLLDRE NRFYFIEMNT RIQVEHPVTE
MVTGLDLVKE QIRVAAGEKL SFGKDDVKIN GWAIECRINA EDPWANFGPR AGRITAYLPA
GGPGIRVDSA VYPGCVVPPY YDSLLAKLVA WGRDRDEAID RAERALAEFV VEGVPTTIPF
HQRVLSNAFF RRGEVYTNFV QRRIFP