Gene B21_02134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02134 
SymbolyfaY 
ID8114725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2251914 
End bp2253116 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content54% 
IMG OID644848341 
Producthypothetical protein 
Protein accessionYP_002999914 
Protein GI251785610 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.495097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAG TGGAAATGTT ATCCACCGGG GATGAAGTGT TACACGGGCA AATCGTTGAC 
ACTAACGCTG CCTGGCTGGC CGATTTTTTC TTTCATCAGG GGTTGCCATT ATCTCGCCGC
AATACGGTGG GGGATAACCT TGATGACTTA GTCACCATTC TTCGCGAACG TAGTCAGCAC
GCCGATGTGC TGATCGTTAA CGGCGGGCTG GGACCGACCA GCGATGATTT AAGCGCACTC
GCCGCTGCGA CAGCAAAAGG TGAAGGCCTG GTGCTGCATG AAGCCTGGCT CAAAGAGATG
GAACGCTATT TCCACGAACG TGGACGAGTA ATGGCACCGA GCAACCGTAA ACAAGCGGAG
CTGCCTGCCA GTGCTGAATT TATCAATAAC CCGGTAGGCA CCGCCTGTGG TTTTGCCGTG
CAGCTTAATC GTTGCCTGAT GTTCTTTACT CCCGGCGTAC CGTCAGAATT TAAGGTGATG
GTCGAGCACG AAATCCTGCC GCGCCTGCGC GAGCGTTTTT CTTTACCGCA GCCGCCGGTT
TGTCTGCGTT TGACTACTTT TGGTCGTTCG GAAAGCGATC TGGCACAAAG CCTGGACACT
CTACAACTGC CGCCGGGCGT AACAATGGGC TATCGCTCCT CAATGCCTAT CATCGAACTG
AAACTCACCG GACCGGCAAG CGAGCAACAG GCGATGGAAA AACTGTGGCT GGATGTTAAA
CGTGTTGCCG GACAGAGCGT GATTTTCGAA GGCACTGAAG GACTGCCCGC GCAGATCAGT
CGCGAATTGC AAAACCGCCA GTTCAGCCTG ACGTTGAGCG AGCAATTCAC CGGTGGTTTA
TTGGCTTTGC AACTTTCTCG CGCAGGTGCT CCATTGCTGG CGTGTGAAGT GGTTCCTTCA
CAGGAGGAAA CCCTGGCGCA AACTGCGCAC TGGATTACAG AACGGCGGGC CAACCATTTT
GCCGGGCTGG CACTGGCTGT TTCGGGTTTC GAGAACGAGC ATCTCAACTT TGCGCTAGCC
ACGCCAGACG GCACTTTCGC TCTGCGTGTG CGTTTCAGCA CTACGCGCTA CAGCCTGGCT
ATCCGTCAGG AAGTGTGCGC AATGATGGCA CTGAATATGC TGCGCCGTTG GTTAAACGGC
CAGGATATCG CCAGTGAGCA TGGCTGGATT GAGGTTGTTG AGTCCATGAC CTTATCTGTC
TGA
 
Protein sequence
MLKVEMLSTG DEVLHGQIVD TNAAWLADFF FHQGLPLSRR NTVGDNLDDL VTILRERSQH 
ADVLIVNGGL GPTSDDLSAL AAATAKGEGL VLHEAWLKEM ERYFHERGRV MAPSNRKQAE
LPASAEFINN PVGTACGFAV QLNRCLMFFT PGVPSEFKVM VEHEILPRLR ERFSLPQPPV
CLRLTTFGRS ESDLAQSLDT LQLPPGVTMG YRSSMPIIEL KLTGPASEQQ AMEKLWLDVK
RVAGQSVIFE GTEGLPAQIS RELQNRQFSL TLSEQFTGGL LALQLSRAGA PLLACEVVPS
QEETLAQTAH WITERRANHF AGLALAVSGF ENEHLNFALA TPDGTFALRV RFSTTRYSLA
IRQEVCAMMA LNMLRRWLNG QDIASEHGWI EVVESMTLSV