Gene ECD_00743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00743 
SymbolbioF 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp794374 
End bp795528 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content58% 
IMG OID 
Product8-amino-7-oxononanoate synthase 
Protein accessionACT42644 
Protein GI253976974 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000953076 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTGGC AGGATAAAAT CAACGCGGCC CTCGATGCGC GGCGTGCTGC CGATGCCCTG 
CGTCGCCGTT ATCCAGTGGC GCAAGGAGCA GGACGCTGGC TGGTGGCGGA CGATCGCCAG
TATCTGAACT TTTCCAGTAA CGATTATTTA GGTTTAAGCC ATCATCCGCA AATTATCCGT
GCCTGGAAGC AGAGTGCGGA GCAATTTGGC GTCGGTAGCG GCGGCTCCGG TCACGTCAGC
GGTTATAGCG TGGCACATCA GGCGCTGGAA GAAGAACTGG CCGAGTGGCT GGGCTATTCG
CGGGCACTGC TGTTTATCTC TGGTTTTGCC GCTAATCAGG CGGTCATTGC CGCGATGATG
GCGAAAGAGG ACCGTATTGT TGCCGACCGG CTTAGCCATG CCTCATTGCT GGAGGCTGCC
AGTTTAAGCC CGTCGCAGCT TCGCCGTTTT GTTCATAACG ATGTCACTCA TCTGGCGCGA
CTGCTTGCTT CCCCCTGTCC GGGGCAGCAA ATGGTGGTGA CAGAAGGCGT GTTCAGCATG
GACGGCGATA GTGCGCCACT GGCGGAAATC CAGCAGGTAA CGCAACAGCA CAATGGCTGG
TTGATGGTCG ATGATGCCCA CGGCACGGGC GTTATCGGGG AGCAGGGGCG CGGCAGCTGC
TGGCTGCAAA AGGTAAAACC AGAATTGCTG GTAGTGACTT TTGGCAAAGG ATTTGGCGTC
AGCGGGGCAG CGGTGCTTTG CTCCAGTACG GTGGCGGATT ATCTGCTGCA ATTCGCCCGC
CACCTTATCT ACAGCACCAG TATGCCGCCC GCTCAGGCGC AGGCATTACG TGCGTCGCTG
GCGGTCATTC GCAGTGATGA GGGTGATGCA CGGCGCGAAA AACTGGCGGC ACTCATTACG
CGTTTTCGTG CCGGAGTACA GGATTTGCCG TTTACGCTTG CTGATTCATG CAGCGCCATC
CAGCCATTGA TTGTCGGTGA TAACAGCCGT GCGTTACAAC TGGCAGAAAA ACTGCGCCAG
CAAGGCTGCT GGGTCACGGC GATTCGCCCG CCAACCGTAC CCGCTGGTAC TGCGCGACTG
CGCTTAACGC TAACCGCTGC GCATGAAATG CAGGATATCG ACCGTCTGCT GGAGGTGCTG
CATGGCAACG GTTAA
 
Protein sequence
MSWQDKINAA LDARRAADAL RRRYPVAQGA GRWLVADDRQ YLNFSSNDYL GLSHHPQIIR 
AWKQSAEQFG VGSGGSGHVS GYSVAHQALE EELAEWLGYS RALLFISGFA ANQAVIAAMM
AKEDRIVADR LSHASLLEAA SLSPSQLRRF VHNDVTHLAR LLASPCPGQQ MVVTEGVFSM
DGDSAPLAEI QQVTQQHNGW LMVDDAHGTG VIGEQGRGSC WLQKVKPELL VVTFGKGFGV
SGAAVLCSST VADYLLQFAR HLIYSTSMPP AQAQALRASL AVIRSDEGDA RREKLAALIT
RFRAGVQDLP FTLADSCSAI QPLIVGDNSR ALQLAEKLRQ QGCWVTAIRP PTVPAGTARL
RLTLTAAHEM QDIDRLLEVL HGNG