Gene Vapar_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_2023 
Symbol 
ID7969816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp2166309 
End bp2167487 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content69% 
IMG OID644792620 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_002943934 
Protein GI239815024 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.525082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGACA TCGTCATCGT TTCGGCTGCA CGCACGGCGG TCGGCAAGTT CGGCGGCTCG 
CTCGCCGGCA TTGCAGCCAC CGAGCTGGGC GCCCTCGTGA TCAAGGAAGT GATCGCGCGC
GCCAATCTCA CGGCCGACCA GGTCGGCGAA GCCATCATGG GCCAGGTGCT GGCCGCCGGT
GCGGGGCAGA ACCCCGCGCG CCAGGCATGG CTCAAGAGCG GCGGCGCCAA GGAAACGCCG
GCGCTCACCA TCAACGCCGT GTGCGGCTCG GGCCTGAAGG CCGTGATGCT CGCGGCGCAG
GCCGTGGCCA CGGGCGACAG CGAGATCGTG ATTGCCGGCG GCCAGGAGAA CATGAGCGCC
GCGCCGCACG TGCTGCCCAA TTCGCGCAAC GGCCAGCGCA TGGGCGACTG GAAGCTGGTC
GACACGATGA TCGTGGATGG CCTGTGGGAC GTCTACAACC AGTACCACAT GGGCATCACG
GCCGAGAACG TGGCCAAGAA GTACGGCATC GACCGCTCCG CGCAGGACGA GCTCGCGCTC
GGCAGCCAGA CCAAGGCCGC GGCTGCGCAG GACGCCGGCA AGTTCAAGGA CGAGATCGTG
CCCGTGAGCA TTGCGCAGAA GAAGGGCGAC CCGATCGTCT TCGACAAGGA CGAGTTCATC
AACCGCAAGA CCAGCGCCGA AGGCCTGGCG GGCCTGCGCC CTGCGTTCGA CAAGGCCGGC
GGCGTGACCG CGGGCAATGC CTCGGGCCTG AACGACGGCG CCGCCGCCGT GATGGTGATG
ACGGCCAAGA AGGCGGCCGC GCTCGGCCTC AAGCCGCTGG GCCGCATCGC AAGCTACGCC
ACGGCCGGCC TCGATCCGGC GATCATGGGC ATGGGCCCGG TGCCGGCGTC GACCAAGGCG
CTGCAGCGCG CCGGCTGGAA GGCCGCCGAC CTCGACCTGC TCGAGATCAA CGAGGCCTTC
GCGGCGCAGG CCTGCGCGGT GAACAAGGAA ATGGGCTGGG ACGTGAACAA GGTCAACGTG
AACGGCGGCG CCATTGCCAT CGGCCACCCG ATCGGCGCGT CGGGCTGCCG CATCCTGGTG
ACGCTGCTGC ACGAGATGCA GCGCCAGAAC GCGAAGAAGG GCATTGCGTC GCTGTGCATC
GGCGGCGGCA TGGGCGTGGC GCTGACGATC GAGCGATAG
 
Protein sequence
MEDIVIVSAA RTAVGKFGGS LAGIAATELG ALVIKEVIAR ANLTADQVGE AIMGQVLAAG 
AGQNPARQAW LKSGGAKETP ALTINAVCGS GLKAVMLAAQ AVATGDSEIV IAGGQENMSA
APHVLPNSRN GQRMGDWKLV DTMIVDGLWD VYNQYHMGIT AENVAKKYGI DRSAQDELAL
GSQTKAAAAQ DAGKFKDEIV PVSIAQKKGD PIVFDKDEFI NRKTSAEGLA GLRPAFDKAG
GVTAGNASGL NDGAAAVMVM TAKKAAALGL KPLGRIASYA TAGLDPAIMG MGPVPASTKA
LQRAGWKAAD LDLLEINEAF AAQACAVNKE MGWDVNKVNV NGGAIAIGHP IGASGCRILV
TLLHEMQRQN AKKGIASLCI GGGMGVALTI ER