Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4056 |
Symbol | |
ID | 6145930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4146121 |
End bp | 4147206 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618880 |
Product | putative oxidoreductase |
Protein accession | YP_001746018 |
Protein GI | 170681012 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATGAATG TGAGTGAAAA GATGGAACAT TTCGACGTGG CGATTATTGG CCTCGGCCCG GCAGGGTCGG TGTTGGCACG AACGTTAGCC AGCAAAATGC AGGTGATCGC GCTGGATAAA AAGCACCAGT GTGGTACTGA AGGTTTCAGC AAACCCTGCG GCGGTCTGCT GGCACCGGAC GCGCAGCGAT CTTTTATTCG CGATGGACTG ACGCTTCCTG TCGATGTGAT CGCCAATCCA CAGATTTTCA GCGTCAAAAC TGTCGACGTC GCCGCATCGC TCACGCGTAA CTACCAGCGA AGCTATATCA ATATTAATCG CCATGCTTTC GACTTGTGGA TGAAATCGCT GATCCCCGCC AGCGTTGAGG TTTACCACGA CAGCCTGTGC CGAAAAATCT GGCGTGAGGA TGATAAATGG CATGTCATTT TTCGTGCAGA CGGTTGGGAG CAGCATATTA CTGCCCGCTA TCTGGTCGGT GCCGATGGCG CAAACTCGAT GGTGCGGCGA CATCTCTACC CGGATCATCA AATTCGTAAA TATGTCGCTA TCCAGCAGTG GTTCGCAGAG AAACATCCGG TGCCGTTCTA CTCCTGCATC TTTGATAATG CGATAACTGA CTGTTACTCA TGGAGTATCA GCAAAGACGG TTATTTTATC TTTGGCGGTG CCTATCCAAT GAAAGACGGT CAGACGCGTT TCGCGACGCT GAAAGAGAAA ATGAGCGCCT TTCAGTTCCA GTTTGGCAAA GCGGTGAAAA GTGAGAAATG CACGGTGCTG TTTCCCTCAC GCTGGCAGGA TTTTGTCTGC GGTAAGGACA ATGCCTTTCT GATTGGTGAA GCGGCGGGAT TTATCAGCGC CAGCTCGCTG GAGGGGATTA GCTATGCGCT GGATAGCGCA GAGATTCTGC GTTCGGTGTT ACTGAAGCAG CCAGAAAAGC TCAATACCGC TTACTGGCGC GCCACCCGCA AACTGCGTTT AAAGCTCTAT GGCAAGATAG TCAAAAGCCG ATGCCTGACC GCACCGGCTT TAAGAAAGTG GATTATGCGC AGTGGTGTGG CGCATATTCC ACAGTTGAAA GATTAG
|
Protein sequence | MMNVSEKMEH FDVAIIGLGP AGSVLARTLA SKMQVIALDK KHQCGTEGFS KPCGGLLAPD AQRSFIRDGL TLPVDVIANP QIFSVKTVDV AASLTRNYQR SYININRHAF DLWMKSLIPA SVEVYHDSLC RKIWREDDKW HVIFRADGWE QHITARYLVG ADGANSMVRR HLYPDHQIRK YVAIQQWFAE KHPVPFYSCI FDNAITDCYS WSISKDGYFI FGGAYPMKDG QTRFATLKEK MSAFQFQFGK AVKSEKCTVL FPSRWQDFVC GKDNAFLIGE AAGFISASSL EGISYALDSA EILRSVLLKQ PEKLNTAYWR ATRKLRLKLY GKIVKSRCLT APALRKWIMR SGVAHIPQLK D
|
| |