Gene ECD_00686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00686 
SymbolsucB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp720092 
End bp721309 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content56% 
IMG OID 
Productdihydrolipoamide acetyltransferase 
Protein accessionACT42561 
Protein GI253976891 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGCG TAGATATTCT GGTCCCTGAC CTGCCTGAAT CCGTAGCCGA TGCCACCGTC 
GCAACCTGGC ATAAAAAACC CGGCGACGCA GTCGTACGTG ATGAAGTGCT GGTAGAAATC
GAAACTGACA AAGTGGTACT GGAAGTACCG GCATCAGCAG ACGGCATTCT GGATGCGGTT
CTGGAAGATG AAGGTACAAC CGTAACGTCT CGTCAGATCC TTGGTCGCCT GCGTGAAGGC
AACAGCGCCG GTAAAGAAAC CAGCGCCAAA TCTGAAGAGA AAGCGTCCAC TCCGGCGCAA
CGCCAGCAGG CGTCTCTGGA AGAGCAAAAC AACGATGCGT TAAGCCCGGC GATCCGTCGC
CTGCTGGCTG AACACAATCT CGACGCCAGC GCCATTAAAG GCACCGGTGT GGGTGGTCGT
CTGACTCGTG AAGATGTGGA AAAACATCTG GCGAAAGCCC CGGCGAAAGA GTCTGCTCCG
GCAGCGGCTG CTCCGGCGGC GCAACCGGCT CTGGCTGCAC GTAGTGAAAA ACGTGTCCCG
ATGACTCGCC TGCGTAAGCG TGTGGCAGAG CGTCTGCTGG AAGCGAAAAA CTCCACCGCC
ATGCTGACCA CGTTCAACGA AGTCAACATG AAGCCGATTA TGGATCTGCG TAAGCAGTAC
GGTGAAGCGT TTGAAAAACG CCACGGCATC CGTCTGGGCT TTATGTCCTT CTACGTGAAA
GCGGTGGTTG AAGCCCTGAA ACGTTACCCG GAAGTGAACG CGTCTATCGA CGGCGATGAC
GTGGTTTACC ACAACTATTT CGACGTCAGC ATGGCGGTTT CTACGCCGCG CGGCCTGGTG
ACACCGGTAC TGCGTGATGT CGATACCCTC GGCATGGCAG ACATCGAGAA GAAAATCAAA
GAGCTGGCAG TCAAAGGCCG TGACGGCAAG CTGACCGTTG AAGATCTGAC CGGTGGTAAC
TTCACCATCA CCAACGGTGG TGTGTTCGGT TCCCTGATGT CTACGCCGAT CATCAACCCG
CCGCAGAGCG CAATTCTGGG TATGCACGCT ATCAAAGATC GTCCGATGGC GGTGAATGGT
CAGGTTGAGA TCCTGCCGAT GATGTACCTG GCGCTGTCCT ACGATCACCG TCTGATCGAT
GGTCGCGAAT CCGTGGGCTT CCTGGTAACG ATCAAAGAGT TGCTGGAAGA TCCGACGCGT
CTGCTGCTGG ACGTGTAG
 
Protein sequence
MSSVDILVPD LPESVADATV ATWHKKPGDA VVRDEVLVEI ETDKVVLEVP ASADGILDAV 
LEDEGTTVTS RQILGRLREG NSAGKETSAK SEEKASTPAQ RQQASLEEQN NDALSPAIRR
LLAEHNLDAS AIKGTGVGGR LTREDVEKHL AKAPAKESAP AAAAPAAQPA LAARSEKRVP
MTRLRKRVAE RLLEAKNSTA MLTTFNEVNM KPIMDLRKQY GEAFEKRHGI RLGFMSFYVK
AVVEALKRYP EVNASIDGDD VVYHNYFDVS MAVSTPRGLV TPVLRDVDTL GMADIEKKIK
ELAVKGRDGK LTVEDLTGGN FTITNGGVFG SLMSTPIINP PQSAILGMHA IKDRPMAVNG
QVEILPMMYL ALSYDHRLID GRESVGFLVT IKELLEDPTR LLLDV