Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3456 |
Symbol | |
ID | 6143348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3530203 |
End bp | 3531210 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618285 |
Product | hypothetical protein |
Protein accession | YP_001745434 |
Protein GI | 170679737 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03558] luciferase family oxidoreductase, group 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATA AAACCATTGC GTTTTCGCTA CTCGATCTGG CCCCCATCCC CGAAGGCTCT TCAGCGCGAG AAGCGTTCTC CCACTCTCTC GATCTCGCCC GTCTGGCTGA AAAGCACGGC TATCATCGCT ACTGGCTGGC AGAACACCAC AATATGACCG GTATTGCCAG TGCTGCAACG TCGGTGTTAA TTGGCTATCT GGCGGCGAAT ACCACCACGC TGCATCTGGG GTCTGGCGGC GTGATGTTGC CTAACCACTC ACCGTTGGTG ATCGCCGAAC AATTCGGCAC GCTCAATACG CTCTATCCGG GGCGAATCGA TTTGGGGCTG GGCCGCGCGC CGGGTAGCGA TCAGCGAACC ATGATGGCGC TGCGTCGTCA TATGAGCGGC GATATTGATA ATTTCCCCCG CGATGTCGCG GAACTGGTGG ACTGGTTTGA CGCCCGCGAT CCCAATCCGC ATGTGCGCCC GGTACCAGGC TATGGCGAGA AAATCCCCGT GTGGTTGTTA GGCTCCAGCC TTTACAGCGC GCAACTGGCG GCGCAGCTTG GTCTGCCGTT TGCGTTTGCC TCACACTTCG CGCCGGATAT GTTGTTCCAG GCGCTGCATC TTTATCGCAG CAACTTCAAA CCGTCGGCAC GGTTGGAGAA ACCGTATGCG ATGGTGTGCA TCAATATTAT CGCCGCCGAC AGCAACCGCG ATGCCGAATT CCTGTTTACC TCAATGCAGC AAGCCTTTGT GAAGCTGCGC CGCGGCGAAA CCGGGCAACT GCCGCCACCG ATTCAAAATA TGGATCAGTT CTGGTCGCCG TCCGAACAGT ATGGTGTGCA GCAGGCGCTG AGTATGTCGC TGGTGGGCGA TAAAGCGAAA GTGCGTCATG GCTTGCAGTC GATCCTGCGC GAAACCGACG CCGATGAAAT TATGGTTAAC GGGCAGATTT TCGACCACCA GGCGCGCCTG CATTCGTTTG AGCTGGTGAT GGATGTGAAG GAAGAATTAT TGGGGTAG
|
Protein sequence | MTDKTIAFSL LDLAPIPEGS SAREAFSHSL DLARLAEKHG YHRYWLAEHH NMTGIASAAT SVLIGYLAAN TTTLHLGSGG VMLPNHSPLV IAEQFGTLNT LYPGRIDLGL GRAPGSDQRT MMALRRHMSG DIDNFPRDVA ELVDWFDARD PNPHVRPVPG YGEKIPVWLL GSSLYSAQLA AQLGLPFAFA SHFAPDMLFQ ALHLYRSNFK PSARLEKPYA MVCINIIAAD SNRDAEFLFT SMQQAFVKLR RGETGQLPPP IQNMDQFWSP SEQYGVQQAL SMSLVGDKAK VRHGLQSILR ETDADEIMVN GQIFDHQARL HSFELVMDVK EELLG
|
| |