Gene Avin_38200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_38200 
SymbolpcaF 
ID7762711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3865658 
End bp3866863 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content71% 
IMG OID643806685 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_002800937 
Protein GI226945864 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCG ACGTCTTCAT CTGCGATGCC GTGCGCACGC CCATCGGCCG CTTCGGCGGC 
GGCCTGGCCG GCGTGCGCGC CGACGACCTG GCCGCCATCC CGCTGAAGGC GCTGCTGGCG
CGCAACCCGC GACTCGATCC GGCCGCCGTC GATGAGGTGT TCCTGGGCTG CGCCAACCAG
GCCGGCGAGG ACAACCGCAA CGTGGCGCGC ATGGCGTCGC TGCTCGCCGG CCTGCCGGAG
ACGGTGCCGG GGGTGACACT CAACCGCCTG TGCGCCTCGG GCATGGATGC GATCGGCACC
GCCGCCCGCG CCATCGCCAG CGGCGAGATC GAGCTGGCCA TCGCCGGCGG CGTGGAGTCC
ATGTCGCGTG CGCCCTTCGT GATGGGCAAG GCCGACGCCG CCTTCTCGCG CAACATGAAG
ATCGAGGACA CCACCATCGG CTGGCGTTTC GTCAACCCGT TGATGAAGCA GCAGTACGGC
GTGGACTCCA TGCCGGAAAC CGCCGACAAC GTCGCCGACG ACTACCGGAT CGGCCGCGCC
GACCAGGACG CCTTCGCCCT GCGCAGCCAG CAGCGCGCGG CGGCGGCCAT GGAGTCCGGC
TACTTCGCCG AGGAGATCGT CCCGGTGGTC ATCAAGACCA GGAAGGGCGA GACGCTGATC
GACACGGACG AGCATCCGCG CCCGGACACC AGCGCCGAGG CGCTGGCCAG GCTCAAGCCG
GTCAACGGCG AGGGCAAGAC GGTTACCGCC GGCAACGCCT CGGGGGTCAA CGACGGCGCC
GCGGCGCTGA TCCTGGCCTC CGCCGAGGCG GTGCGCAAAT ACGGCCTGAA GGCCCGCGCC
CGGGTGCTCG GCATGGCCAG CGCCGGGGTC GCGCCGCGGA TCATGGGCTA CGGCCCGGTG
CCGGCGGTGC GCAAGCTCCT GCAGCGCTTG GAGCTGAGCA TCGACGCCTT CGATGTGATC
GAACTCAACG AGGCCTTCGC CAGCCAGGGC CTGGCGGTAT TGCGCGACCT GGACCTCGCC
GACGACGATG CGCGGGTCAA CCCCAACGGC GGCGCCATCG CCCTCGGCCA CCCGCTGGGC
ATGAGCGGCG CGCGCCTGGT GCTGACCGCC CTGCATCAAC TGGAGAAGTC CGGCGGCAGC
AAGGGCCTGG CGACCATGTG CATCGGCGTC GGCCAGGGGC TGGCGCTGGC CATCGAGCGC
GTCTGA
 
Protein sequence
MSRDVFICDA VRTPIGRFGG GLAGVRADDL AAIPLKALLA RNPRLDPAAV DEVFLGCANQ 
AGEDNRNVAR MASLLAGLPE TVPGVTLNRL CASGMDAIGT AARAIASGEI ELAIAGGVES
MSRAPFVMGK ADAAFSRNMK IEDTTIGWRF VNPLMKQQYG VDSMPETADN VADDYRIGRA
DQDAFALRSQ QRAAAAMESG YFAEEIVPVV IKTRKGETLI DTDEHPRPDT SAEALARLKP
VNGEGKTVTA GNASGVNDGA AALILASAEA VRKYGLKARA RVLGMASAGV APRIMGYGPV
PAVRKLLQRL ELSIDAFDVI ELNEAFASQG LAVLRDLDLA DDDARVNPNG GAIALGHPLG
MSGARLVLTA LHQLEKSGGS KGLATMCIGV GQGLALAIER V