Gene EcSMS35_2695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2695 
SymbolhcaD 
ID6142684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2767381 
End bp2768583 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content55% 
IMG OID641617566 
Productphenylpropionate dioxygenase ferredoxin reductase subunit 
Protein accessionYP_001744731 
Protein GI170683500 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAA AAACGATCAT TATTGTCGGT GGCGGGCAAG CGGCGGCAAT GGCTGCGGCC 
TCGCTACGCC AGCAAGGGTT CACCGGTGAG CTGCATCTGT TTTCCGATGA GCAACATCTT
CCTTATGAAC GCCCCCCGCT CTCGAAATCC ATGTTGCTGG AAGATTCCCC ACAGTTGCAG
TCTGTGTTAC CCGCTCACTG GTGGCAGGAA AACAATGTTC ATCTGCATTC CGGTGTAACC
ATCAAAACAT TGGGCCGCGA CACACGAGAG TTAGTGTTAG CTAACGGCGA AAGCTGGCAC
TGGGATCAGC TTTTTATAGC AACCGGCGCG GCAGCCAGAC CGCTGCCGTT GCTTGATGCA
CTGGGAGAAC GCTGCTTTAC TCTGCGCCAT GCCGGCGATG CCGCCAGACT GCGAGAAGTT
CTGCAGCCCG AACGGTCAGT CGTGATTGTC GGTGCCGGAA CTATTGGTCT GGAACTGGCT
GCCAGCGCCA CGCAGCGTGG ATGTAAGGTG ACAGTGATTG AACTGGCGGC AACCGTCATG
GGCCGTAATG CACCACCGCC CGTGCAACAC TATCTTTTAC AGCGCCATCA GCAGGCTGGT
GTGCGCATTC TGCTCAATAA TGCCATTGAA CATGTGGTCG ATGGTGAAAA CGTAGAACTG
ACGCTGCAAA GTGGCGAAAC GCTTCGGGCC GATGTGGTGA TTTACGGTAT TGGTATCAGC
GCCAACGACC AACTGGCTCG CGAGGCCAAC CTTGATACTG CCAATGGCAT TGTCATTGAT
GAGGCTTGCC GCACCTGCGA TCCCGCGATC TTTGCCGGTG GCGATGTGGC AATCACCCGT
CTTGATAATG GTGCACTACA CCGCTGCGAA AGCTGGGAAA ACGCCAATAA CCAGGCGCAA
ATTGCCGCTT CCGCAATGTT GGGGCTACCG CTTCCGCGAC TGCCGCCGCC GTGGTTCTGG
AGCGATCAGT ACAGTGATAA CTTACAGTTT ATTGGCGATA TGCATGGCGA TGACTGGCTT
TGTCGTGGCA ACCCGGAAAC TCAGAAGGCG ATTTGGTTTA ATCTGCAAAA CGGCGTGCTT
ATCGGTGCAG TCACGCTGAA TCAGGGCCGT GAGATTCGCC CAATCCGCAA ATGGATCCAG
AGCGGCAAAA CGTTTGATGC GAAACTGCTG ACAGATGAGG ACATCGCGCT TAAATCACTG
TAA
 
Protein sequence
MKEKTIIIVG GGQAAAMAAA SLRQQGFTGE LHLFSDEQHL PYERPPLSKS MLLEDSPQLQ 
SVLPAHWWQE NNVHLHSGVT IKTLGRDTRE LVLANGESWH WDQLFIATGA AARPLPLLDA
LGERCFTLRH AGDAARLREV LQPERSVVIV GAGTIGLELA ASATQRGCKV TVIELAATVM
GRNAPPPVQH YLLQRHQQAG VRILLNNAIE HVVDGENVEL TLQSGETLRA DVVIYGIGIS
ANDQLAREAN LDTANGIVID EACRTCDPAI FAGGDVAITR LDNGALHRCE SWENANNQAQ
IAASAMLGLP LPRLPPPWFW SDQYSDNLQF IGDMHGDDWL CRGNPETQKA IWFNLQNGVL
IGAVTLNQGR EIRPIRKWIQ SGKTFDAKLL TDEDIALKSL