Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3612 |
Symbol | |
ID | 6272162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3364906 |
End bp | 3365913 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641727481 |
Product | hypothetical protein |
Protein accession | YP_001881923 |
Protein GI | 187733478 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03558] luciferase family oxidoreductase, group 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATA AAACCATTGC GTTTTCGCTA CTCGATCTGG CCCCCATTCC CGAAGGTTCT TCAGCGCGAG AAGCATTCTC CCACTCTCTC GATCTCGCCC GTCTGGCTGA AAAGCGCGGC TATCATCGCT ACTGGCTGGC AGAACACCAC AATATGACTG GCATTGCCAG TGCTGCCACG TCGGTATTGA TTGGCTATCT GGCGGCGAAT ACCACCACGC TGCATCTGGG GTCTGGCGGC GTGATGTTGC CTAACCACTC ACCGTTGGTC ATTGCCGAAC AGTTCGGCAC GCTCAATACA CTCTATCCGG GGCGAATCGA TTTGGGGCTG GGGCGTGCGT CAGGTAGTGA CCAACGGACA ATGATGGCGC TGCGTCGCCA TATGAGCGGC GATATTGATA ATTTCCCCCG CGATGTCGCG GAGCTGGTGG ACTGGTTTGA CGCCCGCGAT CCCAATCCGC ATGTGCGCCC GGTACCAGGC TATGGCGAGA AAATCCCCGT GTGGTTGTTA GGCTCCAGCC TTTACAGCGC GCAACTGGCG GCGCAGCTTG GTCTGCCGTT CGCGTTTGCC TCACACTTCG CGCCGGATAT GTTGTTCCAG GCGCTGCATC TTTATCGCAG CAACTTCAAA CCGTCGGCAC GACTGGAAAA ACCATACGCG ATGGTGTGCA TCAATATTAT CGCCGCCGAC AGCAACCGCG ATGCCGAATT CCTGTTTACC TCAATGCAGC AAGCCTTTGT GAAGCTGCGC CGCGGCGAAA CCGGGCAACT GCCGCCACCG ATTCAAAATA TGGATCAGTT CTGGTCGCCG TCCGAACAGT ATGGTGTGCA GCAGGCGCTG AGTATGTCGT TGGTAGGTGA TAAAGCGAAA GTGCGTCATG GCTTGCAGTC GATCCTGCGC GAAACCGACG CTGATGAGAT TATGGTCAAC GGGCAGATTT TCGACCACCA GGCGCGGCTG CATTCGTTTG AGCTGGCGAT GGATGTTAAG GAAGAGTTGT TGGGATAG
|
Protein sequence | MTDKTIAFSL LDLAPIPEGS SAREAFSHSL DLARLAEKRG YHRYWLAEHH NMTGIASAAT SVLIGYLAAN TTTLHLGSGG VMLPNHSPLV IAEQFGTLNT LYPGRIDLGL GRASGSDQRT MMALRRHMSG DIDNFPRDVA ELVDWFDARD PNPHVRPVPG YGEKIPVWLL GSSLYSAQLA AQLGLPFAFA SHFAPDMLFQ ALHLYRSNFK PSARLEKPYA MVCINIIAAD SNRDAEFLFT SMQQAFVKLR RGETGQLPPP IQNMDQFWSP SEQYGVQQAL SMSLVGDKAK VRHGLQSILR ETDADEIMVN GQIFDHQARL HSFELAMDVK EELLG
|
| |