Gene Oant_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_4020 
Symbol 
ID5382166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp1440780 
End bp1441985 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content59% 
IMG OID640836706 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001372554 
Protein GI153011340 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0676172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAG CTTTCATCTG CGACTACATC CGCACCCCCA TCGGACGTTT TGGCGGGGCG 
TTGTCCAGCA TCCGCACGGA CGATCTTGCC GCTATTCCGC TCAAAGCACT GGTCGAGCGC
AATCCGGCAC TGGATTGCGA GGCGATTGAG GACGTGATTT TCGGCTGCGC CAATCAGGCA
GGGGAAGATA ACCGCAATGT TGCGCGCATG GCGGCACTGC TTGCTGGTTA TCCCGTTACC
GCCACCGGTA CGACGATCAA CAGGCTTTGC GGTTCCGGGA TGGACGCCGT GCTTGCCGCC
GCGCGCGCCA TTCGTGCAGG CGAAGCAGAA CTTATGATTG CGGGTGGTGT GGAAAGCATG
AGCCGCGCGC CCTTTGTTTT ACCTAAGGCA GACAGCCCGT TCTCCCGCCG CGCAGAAATT
CACGATACGA CAATTGGCTG GCGTTTCGTC AATCCGGTGA TGAAAGCTCA ATACGGCATC
GACTCCATGC CGGAGACCGG GGATCACGTG GCTGTCGATT ACGCGGTCAC ACGCGCCGAT
CAGGATGAAT TCGCCTCGCG AAGCCAGAAG AAGGCTGCTG CAGCACAAAG CAACGGCCGA
CTGGCGCAAG AAATAACGCC CGTTTCCATT CCTCAGCGCA AGGGTGATCC CGTGGTCGTC
GGTGCTGACG AACATCCGCG CGAAACCACG CTGGACGCAC TGGCCAAACT GAAACCAATC
AACCGGCTGG AAGGTGCCAC GGTTACAGCT GGGAATGCGT CGGGCGTCAA CGATGGAGCG
GCAGCGCTCA TCATTGCTTC GGAAGCCGCT GCCCGCAAGT TCGGCCTCAC GCCTGTTGCT
CGAGTTCTGG GTGGTGCTAC GGCAGGAGTT CCTCCACGCA TCATGGGTAT CGGCCCAGCG
CCAGCCAGCC AGAAATTGAT GGCACGGCTG GGAATGAAGC AGGAGCAGTT CGATATAATC
GAACTGAATG AAGCCTTCGC CAGCCAGGGA TTGGCCACGT TACGTCTTCT TGGAATTGCC
GACGACGACA TCCGCGTCAA TCCGAATGGC GGTGCCATCG CTCTCGGACA TCCGCTCGGC
ATGTCCGGAG CCCGTATCAC CGGCACGGCA GCACTGGAAT TGAAACTTGG CGGCGGTCGC
TTTGCGCTCG CCACTATGTG CATCGGTGTG GGGCAAGGTA TCGCCATCGC GCTTGAAAGG
GTTTAA
 
Protein sequence
MTEAFICDYI RTPIGRFGGA LSSIRTDDLA AIPLKALVER NPALDCEAIE DVIFGCANQA 
GEDNRNVARM AALLAGYPVT ATGTTINRLC GSGMDAVLAA ARAIRAGEAE LMIAGGVESM
SRAPFVLPKA DSPFSRRAEI HDTTIGWRFV NPVMKAQYGI DSMPETGDHV AVDYAVTRAD
QDEFASRSQK KAAAAQSNGR LAQEITPVSI PQRKGDPVVV GADEHPRETT LDALAKLKPI
NRLEGATVTA GNASGVNDGA AALIIASEAA ARKFGLTPVA RVLGGATAGV PPRIMGIGPA
PASQKLMARL GMKQEQFDII ELNEAFASQG LATLRLLGIA DDDIRVNPNG GAIALGHPLG
MSGARITGTA ALELKLGGGR FALATMCIGV GQGIAIALER V