Gene EcSMS35_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0901 
Symbolhcr 
ID6143830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp906081 
End bp907049 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content56% 
IMG OID641615789 
ProductHCP oxidoreductase, NADH-dependent 
Protein accessionYP_001742981 
Protein GI170683685 
COG category[C] Energy production and conversion 
COG ID[COG0633] Ferredoxin
[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.843947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGC CAACGAATCA ATGCCCGTGG CGGATGCAGG TTCATCACAT TACGCAAGAA 
ACGCCGGATG TGTGGACGAT TTCCCTGATT TGCCACGATT ACTACCCATA TCGCGCCGGG
CAATATGCAC TGGTCAGCGT GCGTAACTCA GCGGAAACGC TGCGTGCTTA CACCATTTCC
TCCACGCCAG GCGTGAGTGA ATACATCACC CTGACTGTGC GGCGGATTGA TGACGGTGTC
GGCTCCCGGT GGCTGACGCG CGATGTAAAA CGCGGTGATT ATCTCTGGCT TTCGGACGCG
ATGGGGGAAT TTACCTGCGA CGATAAAGCA GAAGATAAAT TCCTGTTGCT GGCGGCAGGC
TGCGGCGTTA CGCCGATTAT GTCGATGCGT CGCTGGCTGG CGAAGAACCG TCCACAGGCC
GATGTGCAGG TGATCTACAA CGTGCGTACG CCGCAGGATG TGATTTTCGC CGATGAGTGG
CGTAACTATC CGGTAACGCT GGTGGCGGAA AATAACGTTA CCGAAGGCTT TATCGCTGGT
CGTCTCACTC GCGAACTGCT GGCAGGTGTC CCTGATTTAG CCTCACGTAC CGTGATGACC
TGTGGCCCTG CTCCATATAT GGATTGGGTA GAGCAGGAAG TGAAAGCGCT CGGCGTGACG
CGTTTCTTTA AAGAGAAATT CTTCACCCCA GTAGCGGAAG CGGCGACCAG CGGTCTGAAA
TTCACCAAAC TGCAACCGGC ACGAGAATTT TACGCCCCGG TTGGCACCAC GCTACTGGAG
GCGCTGGAAA GCAATAACGT TCCGGTTGTC GCCGCCTGCC GCGCAGGTGT TTGCGGCTGC
TGTAAGACGA AAGTGGTTTC CGGTGAATAT ACGGTGAGCA GCACAATGAC GCTGACCGAC
GCCGAAATCG CTGAAGGTTA CGTACTGGCC TGCTCCTGCC ATCCGCAGGG GGATTTGGTT
CTCGCATAA
 
Protein sequence
MTMPTNQCPW RMQVHHITQE TPDVWTISLI CHDYYPYRAG QYALVSVRNS AETLRAYTIS 
STPGVSEYIT LTVRRIDDGV GSRWLTRDVK RGDYLWLSDA MGEFTCDDKA EDKFLLLAAG
CGVTPIMSMR RWLAKNRPQA DVQVIYNVRT PQDVIFADEW RNYPVTLVAE NNVTEGFIAG
RLTRELLAGV PDLASRTVMT CGPAPYMDWV EQEVKALGVT RFFKEKFFTP VAEAATSGLK
FTKLQPAREF YAPVGTTLLE ALESNNVPVV AACRAGVCGC CKTKVVSGEY TVSSTMTLTD
AEIAEGYVLA CSCHPQGDLV LA