Gene ECD_03475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03475 
Symbolkbl 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3660556 
End bp3661752 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content54% 
IMG OID 
Product2-amino-3-ketobutyrate coenzyme A ligase 
Protein accessionACT45274 
Protein GI253979604 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0123868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGGAG AATTTTATCA GCAGTTAACC AACGATCTGG AAACCGCACG GGCGGAAGGG 
TTGTTTAAAG AAGAGCGCAT TATTACGTCT GCGCAGCAAG CAGATATCAC TGTGGCTGAT
GGAAGCCACG TCATTAACTT TTGTGCCAAC AACTATCTCG GGCTGGCGAA TCATCCGGAT
CTGATTGCGG CAGCAAAGGC GGGAATGGAT TCTCACGGTT TCGGCATGGC TTCGGTGCGT
TTTATTTGCG GCACTCAGGA CAGCCATAAA GAGCTTGAAC AAAAACTGGC GGCATTCCTG
GGGATGGAAG ATGCGATTCT CTACTCTTCC TGCTTTGATG CTAACGGTGG ACTGTTTGAA
ACGCTGCTGG GTGCGGAAGA CGCCATTATC TCCGACGCAC TGAACCACGC GTCTATTATT
GATGGTGTGC GTCTGTGCAA AGCTAAACGC TATCGCTATG CCAACAACGA TATGCAGGAG
CTGGAAGCAC GTCTGAAAGA AGCGCGTGAA GCCGGTGCGC GTCATGTGCT GATTGCCACC
GATGGTGTGT TCTCAATGGA CGGCGTGATT GCCAATCTGA AGGGCGTTTG CGATCTGGCA
GATAAATACG ATGCCCTGGT GATGGTAGAC GACTCCCACG CGGTCGGTTT TGTCGGTGAA
AATGGTCGTG GTTCCCATGA ATACTGCGAT GTGATGGGGC GTGTCGACAT CATCACCGGC
ACGCTCGGTA AAGCGCTGGG CGGGGCTTCT GGTGGTTATA CCGCGGCGCG CAAAGAAGTG
GTTGAGTGGC TGCGCCAGCG TTCTCGTCCG TACCTGTTCT CCAACTCGCT GGCACCAGCG
ATTGTCGCCG CTTCCATCAA AGTGCTGGAG ATGGTTGAAG CGGGTAGCGA GTTGCGTGAC
CGTCTGTGGG CGAACGCGCG TCAGTTCCGT GGGCAAATGT CGGCGGCGGG CTTTACCCTG
GCGGGAGCCG ATCACGCCAT TATTCCGGTC ATGCTTGGTG ATGCGGTAGT GGCGCAGAAA
TTTGCCCGTG AGCTGCAAAA AGAGGGCATT TACGTCACCG GTTTCTTCTA TCCGGTCGTT
CCGAAAGGTC AGGCGCGTAT TCGTACCCAG ATGTCGGCGG CGCATACCCC TGAGCAAATT
ACGCGTGCAG TAGAAGCATT CACGCGTATT GGTAAACAAC TGGGCGTTAT CGCCTGA
 
Protein sequence
MRGEFYQQLT NDLETARAEG LFKEERIITS AQQADITVAD GSHVINFCAN NYLGLANHPD 
LIAAAKAGMD SHGFGMASVR FICGTQDSHK ELEQKLAAFL GMEDAILYSS CFDANGGLFE
TLLGAEDAII SDALNHASII DGVRLCKAKR YRYANNDMQE LEARLKEARE AGARHVLIAT
DGVFSMDGVI ANLKGVCDLA DKYDALVMVD DSHAVGFVGE NGRGSHEYCD VMGRVDIITG
TLGKALGGAS GGYTAARKEV VEWLRQRSRP YLFSNSLAPA IVAASIKVLE MVEAGSELRD
RLWANARQFR GQMSAAGFTL AGADHAIIPV MLGDAVVAQK FARELQKEGI YVTGFFYPVV
PKGQARIRTQ MSAAHTPEQI TRAVEAFTRI GKQLGVIA