Gene EcSMS35_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0799 
SymbolbioF 
ID6144826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp802229 
End bp803383 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content58% 
IMG OID641615687 
Product8-amino-7-oxononanoate synthase 
Protein accessionYP_001742879 
Protein GI170680137 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000322202 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGGC AGGAGAAAAT CAACGCGGCG CTCGATGCGC GGCGTGCTGC CGATGCCCTG 
CGTCGCCGTT ATCCAGTGGC GCAAGGAGCC GGACGCTGGC TGGTGGCGGA CGATCGCCAT
TATCTGAACT TTTCCAGTAA CGATTATTTA GGTTTAAGCC ATCATCCGCA AATTATCCGT
GCCTGGAAGC TGAGTGCGGA GCAATTTGGC GTCGGTAGCG GCGGCTCCGG TCACGTCAGC
GGTTATAGCG TGGCGCATCA GGCGCTGGAA GAAGAACTGG CCGAGTGGCT GGGCTATTCG
CGGGCACTGC TGTTTATCTC TGGTTTTGCC GCTAACCAGG CAGTTATTAC CGCGATGATG
GCGAAAGAGG ACCGTATTGT TGCCGACCGG CTTAGCCATG CCTCATTGCT GGAGGCTGCA
AGTTTAAGCC CGTCGCCGCT TCGCCGTTTT GCTCATAACG ATGTCACTCA TCTGGCGCGA
CTGCTTGCTT CCCCCTGTCC GGGGCAGCAA CTGGTAGTGA CAGAAGGCGT GTTCAGCATG
GACGGCGATA GTGCGCCACT GGAGGAAATC CAGCAGGTAA CGCAACAGCA CGATGGCTGG
TTGATGGTCG ATGACGCCCA CGGCACGGGC GTTATCGGGG AGCAGGGGCG TGGCAGCTGC
TGGCTGCAAA AGGTAAAACC AGAATTGCTG GTGGTGACTT TTGGCAAAGG ATTTGGCGTC
AGCGGGGCAG CGGTGCTTTG CTCCAATACG GTGGCGGATT ATCTGCTGCA ATTCGCCCGC
CATCTTATCT ACAGCACCAG TATGCCGCCC GCTCAGGCGC AGGCATTACG TGCGTCGCTG
GCGGTCATTC GCAGTGATGA GGGTGATGCA CGGCGCGAAA AACTGGCGGC ACTCATTACG
CGTTTTCGTG CCGGAGTGCA GGATTTGCCG TTTACGCTTG CTGATTCATG GAGCGCCATC
CAGCCATTGA TCGTCGGTGA TAACAGCCGT GCGTTACAAC TGGCAGAAAA ACTGCGCCAG
CAAGGTTGCT GGGTCACGGC GATTCGCCCG CCAACCGTAC CTGCTGGTAC TGCGCGACTG
CGCTTAACAC TAACCGCCGC GCATGAAATG CAGGATATCG ACCGTCTGCT GGAGGTGCTG
CATGACAACG GTTAA
 
Protein sequence
MSWQEKINAA LDARRAADAL RRRYPVAQGA GRWLVADDRH YLNFSSNDYL GLSHHPQIIR 
AWKLSAEQFG VGSGGSGHVS GYSVAHQALE EELAEWLGYS RALLFISGFA ANQAVITAMM
AKEDRIVADR LSHASLLEAA SLSPSPLRRF AHNDVTHLAR LLASPCPGQQ LVVTEGVFSM
DGDSAPLEEI QQVTQQHDGW LMVDDAHGTG VIGEQGRGSC WLQKVKPELL VVTFGKGFGV
SGAAVLCSNT VADYLLQFAR HLIYSTSMPP AQAQALRASL AVIRSDEGDA RREKLAALIT
RFRAGVQDLP FTLADSWSAI QPLIVGDNSR ALQLAEKLRQ QGCWVTAIRP PTVPAGTARL
RLTLTAAHEM QDIDRLLEVL HDNG