Gene EcSMS35_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3954 
Symbolkbl 
ID6143133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4032572 
End bp4033768 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content54% 
IMG OID641618780 
Product2-amino-3-ketobutyrate coenzyme A ligase 
Protein accessionYP_001745919 
Protein GI170682117 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01822] 2-amino-3-ketobutyrate coenzyme A ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0183173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGAG AATTTTATCA GCAGTTAACC AACGATCTGG AAACCGCACG GGCGGAAGGG 
TTGTTTAAAG AAGAGCGCAT TATTACGTCT GCGCAGCAAG CAGATATCAC TGTGGCTGAT
GGAAGCCACG TCATTAACTT TTGTGCCAAT AACTATCTCG GACTGGCGAA TCATCCGGAT
CTGATTGCGG CGGCAAAGGC GGGAATGGAT TCTCACGGTT TCGGCATGGC TTCGGTGCGT
TTTATTTGCG GCACTCAGGA CAGTCATAAA GAGCTTGAAC AAAAACTGGC GGCCTTCCTG
GGGATGGAAG ATGCGATTCT CTACTCTTCC TGCTTTGATG CTAACGGTGG CCTGTTCGAA
ACGCTGTTAG GCGCGGAAGA CGCCATTATC TCCGACGCAC TGAACCACGC GTCTATTATT
GATGGTGTGC GTCTGTGTAA AGCTAAACGC TATCGTTATG CCAACAACGA TATGCAGGAG
CTGGAAGCAC GTCTGAAAGA AGCGCGTGAA GCCGGTGCGC GTCATGTGCT GATTGCCACC
GATGGTGTGT TCTCGATGGA CGGCGTGATT GCCAACCTGA AAGGTGTTTG CGATCTGGCA
GATAAATACG ATGCCCTGGT GATGGTAGAC GACTCCCACG CGGTCGGTTT TGTCGGTGAA
AACGGTCGTG GTTCCCATGA ATACTGCGAA GTGATGGGCC GGGTCGATAT TATCACCGGC
ACGCTCGGTA AAGCGCTGGG CGGGGCTTCT GGTGGTTATA CCGCGGCGCG CAAAGAAGTG
GTTGAGTGGC TGCGCCAGCG TTCTCGTCCG TACCTGTTCT CCAACTCGCT GGCACCGGCC
ATTGTTGCCG CTTCCATCAA AGTGCTGGAG ATGGTTGAAG CGGGTAGCGA GCTGCGTGAC
CGTCTGTGGG CGAACGCGCG TCAGTTCCGT GAGCAAATGT CGGCGGCGGG CTTTACCCTG
GCGGGAGCCG ATCACGCCAT TATTCCGGTC ATGCTTGGCG ATGCGGTAGT AGCGCAGAAA
TTTGCTCGTG AGCTGCAAAA AGAGGGGATT TACGTCACCG GTTTCTTCTA TCCGGTCGTT
CCGAAAGGTC AGGCGCGTAT TCGTACCCAG ATGTCTGCGG CGCATACCCC TGAGCAAATT
ACGCGTGCAG TAGAAGCATT CACGCGTATT GGTAAACAAC TGGGCGTTAT CGCCTGA
 
Protein sequence
MRGEFYQQLT NDLETARAEG LFKEERIITS AQQADITVAD GSHVINFCAN NYLGLANHPD 
LIAAAKAGMD SHGFGMASVR FICGTQDSHK ELEQKLAAFL GMEDAILYSS CFDANGGLFE
TLLGAEDAII SDALNHASII DGVRLCKAKR YRYANNDMQE LEARLKEARE AGARHVLIAT
DGVFSMDGVI ANLKGVCDLA DKYDALVMVD DSHAVGFVGE NGRGSHEYCE VMGRVDIITG
TLGKALGGAS GGYTAARKEV VEWLRQRSRP YLFSNSLAPA IVAASIKVLE MVEAGSELRD
RLWANARQFR EQMSAAGFTL AGADHAIIPV MLGDAVVAQK FARELQKEGI YVTGFFYPVV
PKGQARIRTQ MSAAHTPEQI TRAVEAFTRI GKQLGVIA