Gene ECD_02175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02175 
SymbolyfaY 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2252605 
End bp2253807 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content54% 
IMG OID 
Productcompetence damage-inducible protein A 
Protein accessionACT43998 
Protein GI253978328 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.478286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAG TGGAAATGTT ATCCACCGGG GATGAAGTGT TACACGGGCA AATCGTTGAC 
ACTAACGCTG CCTGGCTGGC CGATTTTTTC TTTCATCAGG GGTTGCCATT ATCTCGCCGC
AATACGGTGG GGGATAACCT TGATGACTTA GTCACCATTC TTCGCGAACG TAGTCAGCAC
GCCGATGTGC TGATCGTTAA CGGCGGGCTG GGACCGACCA GCGATGATTT AAGCGCACTC
GCCGCTGCGA CAGCAAAAGG TGAAGGCCTG GTGCTGCATG AAGCCTGGCT CAAAGAGATG
GAACGCTATT TCCACGAACG TGGACGAGTA ATGGCACCGA GCAACCGTAA ACAAGCGGAG
CTGCCTGCCA GTGCTGAATT TATCAATAAC CCGGTAGGCA CCGCCTGTGG TTTTGCCGTG
CAGCTTAATC GTTGCCTGAT GTTCTTTACT CCCGGCGTAC CGTCAGAATT TAAGGTGATG
GTCGAGCACG AAATCCTGCC GCGCCTGCGC GAGCGTTTTT CTTTACCGCA GCCGCCGGTT
TGTCTGCGTT TGACTACTTT TGGTCGTTCG GAAAGCGATC TGGCACAAAG CCTGGACACT
CTACAACTGC CGCCGGGCGT AACAATGGGC TATCGCTCCT CAATGCCTAT CATCGAACTG
AAACTCACCG GACCGGCAAG CGAGCAACAG GCGATGGAAA AACTGTGGCT GGATGTTAAA
CGTGTTGCCG GACAGAGCGT GATTTTCGAA GGCACTGAAG GACTGCCCGC GCAGATCAGT
CGCGAATTGC AAAACCGCCA GTTCAGCCTG ACGTTGAGCG AGCAATTCAC CGGTGGTTTA
TTGGCTTTGC AACTTTCTCG CGCAGGTGCT CCATTGCTGG CGTGTGAAGT GGTTCCTTCA
CAGGAGGAAA CCCTGGCGCA AACTGCGCAC TGGATTACAG AACGGCGGGC CAACCATTTT
GCCGGGCTGG CACTGGCTGT TTCGGGTTTC GAGAACGAGC ATCTCAACTT TGCGCTAGCC
ACGCCAGACG GCACTTTCGC TCTGCGTGTG CGTTTCAGCA CTACGCGCTA CAGCCTGGCT
ATCCGTCAGG AAGTGTGCGC AATGATGGCA CTGAATATGC TGCGCCGTTG GTTAAACGGC
CAGGATATCG CCAGTGAGCA TGGCTGGATT GAGGTTGTTG AGTCCATGAC CTTATCTGTC
TGA
 
Protein sequence
MLKVEMLSTG DEVLHGQIVD TNAAWLADFF FHQGLPLSRR NTVGDNLDDL VTILRERSQH 
ADVLIVNGGL GPTSDDLSAL AAATAKGEGL VLHEAWLKEM ERYFHERGRV MAPSNRKQAE
LPASAEFINN PVGTACGFAV QLNRCLMFFT PGVPSEFKVM VEHEILPRLR ERFSLPQPPV
CLRLTTFGRS ESDLAQSLDT LQLPPGVTMG YRSSMPIIEL KLTGPASEQQ AMEKLWLDVK
RVAGQSVIFE GTEGLPAQIS RELQNRQFSL TLSEQFTGGL LALQLSRAGA PLLACEVVPS
QEETLAQTAH WITERRANHF AGLALAVSGF ENEHLNFALA TPDGTFALRV RFSTTRYSLA
IRQEVCAMMA LNMLRRWLNG QDIASEHGWI EVVESMTLSV