Gene ECD_00271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00271 
SymbolyahA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp301999 
End bp303087 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content46% 
IMG OID 
Productpredicted DNA-binding transcriptional regulator 
Protein accessionACT42170 
Protein GI253976500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.809643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAT GTGATTTTCG TGTTTTTCTG CAAGAGTTCG GTACAACGGT TCATTTGTCA 
TTGCCTGGTA GCGTATCCGA GAAAGAACGA CTGCTACTCA AGCTGCTGAT GCAGGGAATG
TCTGTAACAG AAATATCACA GTACAGAAAT CGCAGTGCAA AGACAATTTC ACATCAAAAG
AAACAGCTCT TTGAGAAACT GGGGATTCAG AGCGATATTA CTTTCTGGCG CGATATTTTC
TTTCAGTACA ATCCGGAGAT CATATCCGCC ACGGGGAGTA ATAGTCACAG ATATATTAAT
GATAATCACT ATCACCATAT CGTCACGCCT GAAGCCATCA GTCTGGCGTT GGAAAACCAC
GAATTCAAAC CGTGGATCCA ACCGGTTTTC TGCGCGCAGA CTGGGGTACT GACGGGCTGT
GAGGTGCTTG TCCGCTGGGA ACATCCACAA ACGGGAATTA TCCCACCGGA TCAGTTTATT
CCTCTGGCGG AGTCATCCGG TCTTATTGTC ATAATGACCC GCCAACTGAT GAAACAGACT
GCGGATATTC TGATGCCGGT AAAACATTTG CTGCCGGACA ATTTCCATAT TGGCATCAAC
GTCTCGGCGG GTTGTTTTTT GGCAGCGGGA TTTGAAAAAG AGTGTCTGAA CCTGGTTAAT
AAATTAGGTA ACGATAAAAT CAAGCTGGTT CTCGAGCTAA CGGAACGTAA CCCTATTCCG
GTAACGCCAG AAGCCAGAGC GATATTTGAC AGCCTTCATC AGCACAACAT TACCTTTGCG
CTGGATGACT TTGGTACGGG TTATGCGACC TATCGTTACT TGCAGGCGTT CCCGGTCGAT
TTTATTAAGA TCGATAAGTC ATTTGTGCAA ATGGCGAGTG TCGACGAAAT CTCCGGTCAT
ATTGTGGACA ATATTGTCGA ACTAGCGCGT AAGCCTGGTC TGAGTATCGT GGCGGAAGGG
GTAGAAACCC AGGAGCAGGC GGATTTAATG ATCGGTAAAG GCGTTCACTT TTTGCAGGGC
TATTTGTACT CTCCGCCAGT ACCGGGTAAT AAATTTATCT CTGAATGGGT AATGAAAGCA
GGTGGTTGA
 
Protein sequence
MNSCDFRVFL QEFGTTVHLS LPGSVSEKER LLLKLLMQGM SVTEISQYRN RSAKTISHQK 
KQLFEKLGIQ SDITFWRDIF FQYNPEIISA TGSNSHRYIN DNHYHHIVTP EAISLALENH
EFKPWIQPVF CAQTGVLTGC EVLVRWEHPQ TGIIPPDQFI PLAESSGLIV IMTRQLMKQT
ADILMPVKHL LPDNFHIGIN VSAGCFLAAG FEKECLNLVN KLGNDKIKLV LELTERNPIP
VTPEARAIFD SLHQHNITFA LDDFGTGYAT YRYLQAFPVD FIKIDKSFVQ MASVDEISGH
IVDNIVELAR KPGLSIVAEG VETQEQADLM IGKGVHFLQG YLYSPPVPGN KFISEWVMKA
GG