Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2895 |
Symbol | |
ID | 6146384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2966557 |
End bp | 2967828 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617764 |
Product | pyridine nucleotide-disulphide oxidoreductase family protein |
Protein accession | YP_001744919 |
Protein GI | 170683360 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGACG ACTGCGACAT TATTATTATT GGTGCCGGTA TTGCAGGCAC CGCTTGCGCG TTACGCTGCG CGCGAGCGGG TTTATCCGTT TTGTTACTGG AACGCGCTGA AATCCCCGGC AGCAAAAATC TTTCCGGCGG GCGTTTATAT ACCCAGGCGC TCGCGGAACT CCTCCCACAA TTTCATCTGA CCGCGCCTCT TGAACGACGC ATCACTCACG AAAGCCTTTC CCTGTTAACG CCGGATGGCG CAACGACGTT TTCCAGCTTA CAGCCCGGCG GTGAATCCTG GAGTGTATTA CGTGCACGGT TCGATCCGTG GCTGGTTGCC GAAGCCGAAA AAGAAGGTGT CGAATGCATC CCCGGTGCGA CGGTGGATGC GCTGTATGAA GAAAACGGCA GGGTGTGTGG TGTCATTTGT GGTGACGATA TTCTCCGCGC CCGTTATGTG GTGCTGGCAG AAGGTGCCAA CAGCGTCCTG GCTGAACGTC ACGGGTTAGT GACTCGTCCT GCTGGCGAAG CGATGGCGTT GGGGATCAAA GAAGTGCTGT CGCTGGAAAC ATCCGCTATT GAAGAACGTT TTCATCTGGA GAATAACGAA GGCGCAGCGT TGCTGTTCAG CGGCGGGATC TGTGATGACT TACCCGGCGG CGCATTTCTT TATACTAATC AACAAACGCT CTCGTTAGGG ATTGTTTGCC CGCTCTCTTC ACTTACGCAA AGTCGTGTTC CGGCAAGCGA GCTGCTGGCT CGCTTTAAAA CGCATCCGGC AGTGCGCCCG CTTATCAAAA ACACGGAATC ACTGGAGTAT GGTGCGCATC TGGTGCCAGA AGGTGGCTTG CACAGTATGC AGGTGCAATA CGCCGGTAAC GGCTGGCTGC TGGTGGGCGA TACGTTGCGC AGTTGCGTCA ATACCGGAAT TTCCGTGCGC GGCATGGATA TGGCGCTAAC TGGCGCGCAG GCGGCGGCAC AAACGCTGAT AAGCGCCTGC CAGCACCGCG AGCCGCAAAA TCTGTTTGCG CTTTATCATC ACAACGTAGA GCGCAGCCTG CTGTGGGATG TTCTACAACG TTATCAGCAT GTTCCGGCGC TTTTGCAACG CCCTGGCTGG TATCGGGCGT GGCCTGCGTT AATGCAGGAT ATTTCCCGCG ATTTATGGGA TCAGGGTGAT AAACCTGTTC CACCGCTGCA CCAGTTATTC TGGCGTCATT TACGTCGTCA CGGCCTGTGG CATCTGGCGG GCGATGTTAT CAGGAGTCTG CGATGTCTGT AG
|
Protein sequence | MEDDCDIIII GAGIAGTACA LRCARAGLSV LLLERAEIPG SKNLSGGRLY TQALAELLPQ FHLTAPLERR ITHESLSLLT PDGATTFSSL QPGGESWSVL RARFDPWLVA EAEKEGVECI PGATVDALYE ENGRVCGVIC GDDILRARYV VLAEGANSVL AERHGLVTRP AGEAMALGIK EVLSLETSAI EERFHLENNE GAALLFSGGI CDDLPGGAFL YTNQQTLSLG IVCPLSSLTQ SRVPASELLA RFKTHPAVRP LIKNTESLEY GAHLVPEGGL HSMQVQYAGN GWLLVGDTLR SCVNTGISVR GMDMALTGAQ AAAQTLISAC QHREPQNLFA LYHHNVERSL LWDVLQRYQH VPALLQRPGW YRAWPALMQD ISRDLWDQGD KPVPPLHQLF WRHLRRHGLW HLAGDVIRSL RCL
|
| |