Gene B21_00274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00274 
SymbolyahA 
ID8113046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp301992 
End bp303080 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content46% 
IMG OID644846563 
Producthypothetical protein 
Protein accessionYP_002998136 
Protein GI251783832 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.725315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAT GTGATTTTCG TGTTTTTCTG CAAGAGTTCG GTACAACGGT TCATTTGTCA 
TTGCCTGGTA GCGTATCCGA GAAAGAACGA CTGCTACTCA AGCTGCTGAT GCAGGGAATG
TCTGTAACAG AAATATCACA GTACAGAAAT CGCAGTGCAA AGACAATTTC ACATCAAAAG
AAACAGCTCT TTGAGAAACT GGGGATTCAG AGCGATATTA CTTTCTGGCG CGATATTTTC
TTTCAGTACA ATCCGGAGAT CATATCCGCC ACGGGGAGTA ATAGTCACAG ATATATTAAT
GATAATCACT ATCACCATAT CGTCACGCCT GAAGCCATCA GTCTGGCGTT GGAAAACCAC
GAATTCAAAC CGTGGATCCA ACCGGTTTTC TGCGCGCAGA CTGGGGTACT GACGGGCTGT
GAGGTGCTTG TCCGCTGGGA ACATCCACAA ACGGGAATTA TCCCACCGGA TCAGTTTATT
CCTCTGGCGG AGTCATCCGG TCTTATTGTC ATAATGACCC GCCAACTGAT GAAACAGACT
GCGGATATTC TGATGCCGGT AAAACATTTG CTGCCGGACA ATTTCCATAT TGGCATCAAC
GTCTCGGCGG GTTGTTTTTT GGCAGCGGGA TTTGAAAAAG AGTGTCTGAA CCTGGTTAAT
AAATTAGGTA ACGATAAAAT CAAGCTGGTT CTCGAGCTAA CGGAACGTAA CCCTATTCCG
GTAACGCCAG AAGCCAGAGC GATATTTGAC AGCCTTCATC AGCACAACAT TACCTTTGCG
CTGGATGACT TTGGTACGGG TTATGCGACC TATCGTTACT TGCAGGCGTT CCCGGTCGAT
TTTATTAAGA TCGATAAGTC ATTTGTGCAA ATGGCGAGTG TCGACGAAAT CTCCGGTCAT
ATTGTGGACA ATATTGTCGA ACTAGCGCGT AAGCCTGGTC TGAGTATCGT GGCGGAAGGG
GTAGAAACCC AGGAGCAGGC GGATTTAATG ATCGGTAAAG GCGTTCACTT TTTGCAGGGC
TATTTGTACT CTCCGCCAGT ACCGGGTAAT AAATTTATCT CTGAATGGGT AATGAAAGCA
GGTGGTTGA
 
Protein sequence
MNSCDFRVFL QEFGTTVHLS LPGSVSEKER LLLKLLMQGM SVTEISQYRN RSAKTISHQK 
KQLFEKLGIQ SDITFWRDIF FQYNPEIISA TGSNSHRYIN DNHYHHIVTP EAISLALENH
EFKPWIQPVF CAQTGVLTGC EVLVRWEHPQ TGIIPPDQFI PLAESSGLIV IMTRQLMKQT
ADILMPVKHL LPDNFHIGIN VSAGCFLAAG FEKECLNLVN KLGNDKIKLV LELTERNPIP
VTPEARAIFD SLHQHNITFA LDDFGTGYAT YRYLQAFPVD FIKIDKSFVQ MASVDEISGH
IVDNIVELAR KPGLSIVAEG VETQEQADLM IGKGVHFLQG YLYSPPVPGN KFISEWVMKA
GG