Gene EcSMS35_2895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2895 
Symbol 
ID6146384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2966557 
End bp2967828 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content55% 
IMG OID641617764 
Productpyridine nucleotide-disulphide oxidoreductase family protein 
Protein accessionYP_001744919 
Protein GI170683360 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGACG ACTGCGACAT TATTATTATT GGTGCCGGTA TTGCAGGCAC CGCTTGCGCG 
TTACGCTGCG CGCGAGCGGG TTTATCCGTT TTGTTACTGG AACGCGCTGA AATCCCCGGC
AGCAAAAATC TTTCCGGCGG GCGTTTATAT ACCCAGGCGC TCGCGGAACT CCTCCCACAA
TTTCATCTGA CCGCGCCTCT TGAACGACGC ATCACTCACG AAAGCCTTTC CCTGTTAACG
CCGGATGGCG CAACGACGTT TTCCAGCTTA CAGCCCGGCG GTGAATCCTG GAGTGTATTA
CGTGCACGGT TCGATCCGTG GCTGGTTGCC GAAGCCGAAA AAGAAGGTGT CGAATGCATC
CCCGGTGCGA CGGTGGATGC GCTGTATGAA GAAAACGGCA GGGTGTGTGG TGTCATTTGT
GGTGACGATA TTCTCCGCGC CCGTTATGTG GTGCTGGCAG AAGGTGCCAA CAGCGTCCTG
GCTGAACGTC ACGGGTTAGT GACTCGTCCT GCTGGCGAAG CGATGGCGTT GGGGATCAAA
GAAGTGCTGT CGCTGGAAAC ATCCGCTATT GAAGAACGTT TTCATCTGGA GAATAACGAA
GGCGCAGCGT TGCTGTTCAG CGGCGGGATC TGTGATGACT TACCCGGCGG CGCATTTCTT
TATACTAATC AACAAACGCT CTCGTTAGGG ATTGTTTGCC CGCTCTCTTC ACTTACGCAA
AGTCGTGTTC CGGCAAGCGA GCTGCTGGCT CGCTTTAAAA CGCATCCGGC AGTGCGCCCG
CTTATCAAAA ACACGGAATC ACTGGAGTAT GGTGCGCATC TGGTGCCAGA AGGTGGCTTG
CACAGTATGC AGGTGCAATA CGCCGGTAAC GGCTGGCTGC TGGTGGGCGA TACGTTGCGC
AGTTGCGTCA ATACCGGAAT TTCCGTGCGC GGCATGGATA TGGCGCTAAC TGGCGCGCAG
GCGGCGGCAC AAACGCTGAT AAGCGCCTGC CAGCACCGCG AGCCGCAAAA TCTGTTTGCG
CTTTATCATC ACAACGTAGA GCGCAGCCTG CTGTGGGATG TTCTACAACG TTATCAGCAT
GTTCCGGCGC TTTTGCAACG CCCTGGCTGG TATCGGGCGT GGCCTGCGTT AATGCAGGAT
ATTTCCCGCG ATTTATGGGA TCAGGGTGAT AAACCTGTTC CACCGCTGCA CCAGTTATTC
TGGCGTCATT TACGTCGTCA CGGCCTGTGG CATCTGGCGG GCGATGTTAT CAGGAGTCTG
CGATGTCTGT AG
 
Protein sequence
MEDDCDIIII GAGIAGTACA LRCARAGLSV LLLERAEIPG SKNLSGGRLY TQALAELLPQ 
FHLTAPLERR ITHESLSLLT PDGATTFSSL QPGGESWSVL RARFDPWLVA EAEKEGVECI
PGATVDALYE ENGRVCGVIC GDDILRARYV VLAEGANSVL AERHGLVTRP AGEAMALGIK
EVLSLETSAI EERFHLENNE GAALLFSGGI CDDLPGGAFL YTNQQTLSLG IVCPLSSLTQ
SRVPASELLA RFKTHPAVRP LIKNTESLEY GAHLVPEGGL HSMQVQYAGN GWLLVGDTLR
SCVNTGISVR GMDMALTGAQ AAAQTLISAC QHREPQNLFA LYHHNVERSL LWDVLQRYQH
VPALLQRPGW YRAWPALMQD ISRDLWDQGD KPVPPLHQLF WRHLRRHGLW HLAGDVIRSL
RCL