Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3902 |
Symbol | |
ID | 5592377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3896548 |
End bp | 3897648 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640923010 |
Product | putative oxidoreductase |
Protein accession | YP_001460487 |
Protein GI | 157163169 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 77 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACATT TCGACGTGGC GATTATTGGC CTCGGCCCGG CAGGGTCGGC GTTGGCACGA AAGTTAGCCG GCAAAATGCA GGTGATCGCG CTGGATAAAA AGCACCAGTG TGGTACTGAA GGTTTCAGCA AACCTTGTGG CGGTCTGCTG GCACCGGACG CGCAGCGTTC TTTTATTCGC GATGGACTGA CGCTCCCTGT CGATGTGATC GCCAATCCGC AGATTTTCAG CGTCAAAACC GTCGACGTCG CCGCATCGCT CACACGTAAC TACCAGCGAA GCTATATCAA TATTAATCGC CACGCTTTCG ACTTGTGGAT GAAATCACTG ATCCCCGCCA GCGTTGAGGT TTACCACGAT AGCCTGTGCC GGAAAATCTG GCGTGAGGAT GATAAATGGC ATGTCATTTT TCGTGCAGAC GGTTGGGAGC AGCATATTTC CGCCCGCTAT CTGGTCGGTG CCGATGGTGC CAACTCGATG GTGCGGCGAC ATCTCTACCC GGATCATCAA ATCCGTAAAT ATGTCGCTAT CCAGCAGTGG TTTGCAGAGA AACATCCGGT ACCGTTCTAC TCCTGCATCT TTGATAATGA AATAACTGAC TGTTATTCAT GGAGTATCAG CAAAGACGGT TATTTTATCT TTGGCGGTGC TTATCCAATG AAAGACGGTC AGACGCGTTT CACGACGCTG AAAGAGAAAA TGAGCGCCTT TCAGTTCCAG TTTGGTAAGG CGGTGAAAAG CGAAAAATGC ACGGTGCTGT TTCCCTCGCG CTGGCAGGAT TTTGTCTGCG GTAAGGACAA CGCCTTTCTG ATTGGCGAAG CGGCAGGATT TATCAGCGCC AGCTCGCTGG AGGGGATTAG CTATGCGCTG GATAGCGCAG AGATTCTGCG TGCGGTGTTA CTGAAGCAGC CGGAGAAGAG CAACGCCGCC TACTGGCGCG CCACCCGCAA ACTGCGTTTA AAACTCTTCG GCAAGATAGT AAAAAGCCGA TGCCTGACCG CACCGGCTTT AAGAAAGTGG ATTATGCGCA GTGGTGTGGC GCATATTCCA CAGTTGAAAG ATTATCCAAC GCGCTTCACA TCGCCCACCA GCAGGATGTA A
|
Protein sequence | MEHFDVAIIG LGPAGSALAR KLAGKMQVIA LDKKHQCGTE GFSKPCGGLL APDAQRSFIR DGLTLPVDVI ANPQIFSVKT VDVAASLTRN YQRSYININR HAFDLWMKSL IPASVEVYHD SLCRKIWRED DKWHVIFRAD GWEQHISARY LVGADGANSM VRRHLYPDHQ IRKYVAIQQW FAEKHPVPFY SCIFDNEITD CYSWSISKDG YFIFGGAYPM KDGQTRFTTL KEKMSAFQFQ FGKAVKSEKC TVLFPSRWQD FVCGKDNAFL IGEAAGFISA SSLEGISYAL DSAEILRAVL LKQPEKSNAA YWRATRKLRL KLFGKIVKSR CLTAPALRKW IMRSGVAHIP QLKDYPTRFT SPTSRM
|
| |