Gene Avin_16380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_16380 
Symbol 
ID7760573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1620013 
End bp1621206 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content70% 
IMG OID643804538 
Productacetoacetyl-CoA thiolase 
Protein accessionYP_002798828 
Protein GI226943755 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.506905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCAC AAATACAAGA AGTGGTCATC GTCAGCGGGG TTCGCACGGC CATCGGCGGC 
TTCGGCGGCA GCCTGAAGAG CCACCGGCCC GGCGAACTGG GCGGCCTGCT GGTCGCCGAG
GCGGTACGCC GGGCGGGCAT CGATGCGGCC GGCATCGGCC ATTGCGTGTT CGGCAACGTC
ATCCACAGCG AGCCGCGCGA CATGTACATC AGCCGGGTGG CGGCGCTGCA GGGCGGCCTG
TCCGTGGATA CCCCGGCGCT GACCGTCAAC CGCCTGTGCG GCAGCGGCCT GCAGGCCATC
GTCAGCGCCG CCCAGCAGAT CCAGCTCGGC CTGTGCGACG CGGCGGTGGC CGGCGGCGCC
GAGTCGATGA GCCTGGCGCC CTACCACCTG CCCGCCGGGC GCTTCGGCCA GCGGATGGGC
GACGGGGTGA CCGTCGACCC GATGGTCGGC GCCCTGCAGT GCCCGATCAA CCGCTACCAC
ATGGGCGTGA CCGCGGAGAA CGTCGCCGAG CAGTACGGCA TCACCCGCGA GCAGCAGGAT
GCGCTGGCCG TCGAGAGCCA CAAGCGCGCC CAGCGCGCCG TCGAGCAGGG GTATTTCAAG
GAGCAGATCC TGCCCATCGA GCTGAAGAGC CGCAAAGGCT CGACCTTCTT CGATACCGAC
GAGCACATCC GCTTCGACTG CACGCTCGGC GACCTCGAGC CGCTGAAGCC GGTGTTCAAG
AAGGAGGGCG GCACGGTCAC CGCCGGCAAC GCCTCCGGGC TCAACGACGG CGCCGCCGCC
CTGGTGCTGA TGGCGCGCAC CAGGGCCGAA GCCGAGGGTC GTCAGGTCCT GGCCCGATTG
GTCGATTATG CGGTGGTCGG CGTCGAGCCG AGCATCATGG GCATCGGCCC GGTTCCGGCC
ATTCGCCAGT TGCTCGAACG CAACGGCCTG GGCGTCGGCG ACATCGACGT CTTCGAGGTC
AACGAGGCCT TCGCCGCCCA GGCGCTGGCC GTCGCCCAGC AACTGGAACT GCCCGCCGAG
CGCCTCAACC CCAACGGCAG CGGCATCTCC ATGGGCCATC CGATCGGCGC CACCGGCGCC
ATCATCACGG TGAAGGCGAT CCACGAGCTG GCCCGCGTCC TGGGCCGCTA CGCCATCGTC
AGCCTGTGCA TCGGCGGCGG CCAGGGCATC GCGGCGCTGC TGCGGCGCGA CTGA
 
Protein sequence
MSAQIQEVVI VSGVRTAIGG FGGSLKSHRP GELGGLLVAE AVRRAGIDAA GIGHCVFGNV 
IHSEPRDMYI SRVAALQGGL SVDTPALTVN RLCGSGLQAI VSAAQQIQLG LCDAAVAGGA
ESMSLAPYHL PAGRFGQRMG DGVTVDPMVG ALQCPINRYH MGVTAENVAE QYGITREQQD
ALAVESHKRA QRAVEQGYFK EQILPIELKS RKGSTFFDTD EHIRFDCTLG DLEPLKPVFK
KEGGTVTAGN ASGLNDGAAA LVLMARTRAE AEGRQVLARL VDYAVVGVEP SIMGIGPVPA
IRQLLERNGL GVGDIDVFEV NEAFAAQALA VAQQLELPAE RLNPNGSGIS MGHPIGATGA
IITVKAIHEL ARVLGRYAIV SLCIGGGQGI AALLRRD