Gene B21_02978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02978 
SymbolyhbW 
ID8114024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3171154 
End bp3172161 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content56% 
IMG OID644849163 
Producthypothetical protein 
Protein accessionYP_003000736 
Protein GI251786432 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATA AAACCATTGC GTTTTCGCTA CTCGATCTGG CCCCCATTCC CGAAGGTTCT 
TCAGCGCGAG AAGCATTCTC CCACTCTCTC GATCTCGCCC GTCTGGCTGA AAAGCGCGGC
TATCATCGCT ACTGGCTGGC AGAACACCAC AATATGACTG GCATTGCCAG TGCTGCCACG
TCGGTATTGA TCGGCTATCT GGCGGCGAAT ACCACCACGC TGCATCTGGG GTCTGGCGGC
GTGATGTTGC CTAACCACTC ACCGTTGGTC ATTGCAGAAC AGTTCGGCAC GCTTAATACA
CTCTATCCGG GGCGAATCGA TTTGGGGCTG GGTCGTGCTC CGGGTAGTGA CCAACGGACA
ATGATGGCGC TACGTCGTCA TATGAGCGGC GATATTGATA ATTTCCCCCG CGATGTGGCG
GAGCTGGTGG ACTGGTTTGA CGCCCGCGAT CCCAATCCGC ATGTGCGCCC GGTACCAGGC
TATGGCGAGA AAATCCCCGT GTGGTTGTTA GGCTCCAGCC TTTACAGCGC GCAACTGGCG
GCGCAGCTTG GTCTGCCGTT TGCGTTTGCC TCACACTTCG CGCCGGATAT GCTGTTCCAG
GCGCTGCATC TTTATCGCAG CAACTTCAAA CCGTCAGCAC GGCTGGAAAA ACCATACGCG
ATGGTGCGCA TCAATATTAT CGCCGCCGAC AGCAACCGCG ACGCTGAATT TCTGTTTACC
TCAATGCAGC AAGCCTTTGT GAAGCTGCGC CGTGGCGAAA CCGGGCAACT GCCGCCGCCG
ATTCAAAATA TGGATCAGTT CTGGTCACCG TCTGAGCAGT ATGGCGTGCA GCAGGCGCTG
AGTATGTCGT TGGTAGGTGA TAAAGCGAAA GTGCGTCATG GCTTGCAGTC GATCCTGCGC
GAAACCGACG CCGATGAGAT TATGGTCAAC GGGCAGATTT TCGACCACCA GGCGCGGCTG
CATTCGTTTG AGCTGGCGAT GGATGTTAAG GAAGAGTTGT TGGGATAG
 
Protein sequence
MTDKTIAFSL LDLAPIPEGS SAREAFSHSL DLARLAEKRG YHRYWLAEHH NMTGIASAAT 
SVLIGYLAAN TTTLHLGSGG VMLPNHSPLV IAEQFGTLNT LYPGRIDLGL GRAPGSDQRT
MMALRRHMSG DIDNFPRDVA ELVDWFDARD PNPHVRPVPG YGEKIPVWLL GSSLYSAQLA
AQLGLPFAFA SHFAPDMLFQ ALHLYRSNFK PSARLEKPYA MVRINIIAAD SNRDAEFLFT
SMQQAFVKLR RGETGQLPPP IQNMDQFWSP SEQYGVQQAL SMSLVGDKAK VRHGLQSILR
ETDADEIMVN GQIFDHQARL HSFELAMDVK EELLG