Gene ECD_02581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02581 
SymbolfhlA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2694305 
End bp2696383 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content53% 
IMG OID 
ProductDNA-binding transcriptional activator 
Protein accessionACT44400 
Protein GI253978730 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATATA CACCGATGAG TGATCTCGGA CAACAAGGGT TGTTCGACAT CACTCGGACA 
CTATTGCAGC AGCCCGATCT GGCCTCGCTG TGTGAGGCTC TTTCGCAACT GGTAAAGCGT
TCTGCGCTCG CCGACAACGC GGCTATTGTG TTGTGGCAAG CGCAGACTCA ACGTGCGTCT
TATTACGCGT CGCGTGAAAA AGACACCCCC ATTAAATATG AAGACGAAAC TGTTCTGGCA
CACGGTCCGG TACGCAGCAT TTTGTCGCGC CCTGATACGC TGCATTGCAG TTACGAAGAA
TTTTGTGAAA CCTGGCCGCA GCTGGACGCA GGTGGGCTAT ACCCAAAATT TGGTCACTAT
TGCCTGATGC CACTGGCGGC GGAAGGGCAT ATTTTTGGTG GCTGTGAATT TATTCGTTAT
GACGATCGCC CCTGGAGCGA AAAAGAGTTC AATCGTCTGC AAACATTTAC GCAGATCGTT
TCTGTCGTCA CCGAACAAAT CCAGAGCCGC GTCGTTAACA ATGTCGACTA TGAGTTGTTA
TGCCGGGAAC GCGATAACTT CCGCATCCTG GTCGCCATCA CCAACGCGGT GCTTTCCCGC
CTGGATATGG ACGAACTGGT CAGCGAAGTC GCCAAAGAAA TCCATTACTA TTTCGACATT
GACGATATCA GTATCGTCTT ACGCAGCCAC CGTAAAAACA AACTCAACAT CTACTCCACT
CACTATCTTG ATAAACAGCA TCCCGCCCAC GAACAGAGCG AAGTCGATGA AGCCGGAACC
CTCACCGAAC GCGTGTTCAA AAGTAAAGAG ATGCTGCTGA TCAATCTCCA CGAGCGGGAC
GATTTAGCCC CCTATGAACG CATGTTGTTC GACACCTGGG GCAACCAGAT TCAAACCTTG
TGCCTGTTAC CGCTGATGTC TGGCGACACC ATGCTGGGCG TGCTGAAACT GGCGCAATGC
GAAGAGAAAG TGTTTACCAC TACCAATCTG AATTTACTGC GCCAGATTGC CGAACGTGTG
GCAATCGCTG TCGATAACGC CCTCGCCTAT CAGGAAATCC ATCGTCTGAA AGAACGGCTG
GTTGATGAAA ACCTCGCCCT GACCGAGCAG CTCAACAATG TTGATAGTGA ATTTGGCGAG
ATTATTGGCC GCAGCGAAGC CATGTACAGC GTGCTTAAAC AAGTTGAAAT GGTGGCGCAA
AGTGACAGTA CCGTGCTGAT CCTCGGTGAA ACTGGCACGG GTAAAGAGCT GATTGCCCGT
GCGATCCATA ATCTCAGTGG GCGTAATAAT CGCCGCATGG TCAAAATGAA CTGCGCGGCG
ATGCCTGCCG GATTGCTGGA AAGCGATCTG TTTGGTCATG AGCGTGGGGC TTTTACCGGT
GCCAGCGCCC AGCGTATCGG TCGTTTTGAA CTGGCGGATA AAAGCTCCCT GTTCCTCGAC
GAAGTGGGCG ATATGCCACT GGAGTTACAG CCGAAGTTGC TGCGTGTATT GCAGGAACAG
GAGTTTGAAC GTCTCGGCAG CAACAAAATC ATTCAGACGG ACGTGCGTCT AATCGCCGCG
ACTAACCGCG ATCTGAAAAA AATGGTCGCC GACCGTGAGT TCCGTAGCGA TCTCTATTAC
CGCCTGAACG TATTCCCGAT TCACCTGCCG CCACTACGCG AGCGTCCGGA AGATATTCCG
CTGCTGGCGA AAGCCTTTAC CTTCAAAATT GCCCGTCGTC TGGGGCGCAA TATCGACAGC
ATTCCTGCCG AGACGCTGCG CACCTTGAGC AACATGGAGT GGCCGGGTAA CGTACGCGAA
CTGGAAAACG TCATTGAGCG CGCGGTATTG CTAACACGCG GTAACGTGCT GCAGCTGTCA
TTGCCAGATA TTGTTTTACC GGAACCTGAA ACGCCGCCTG CCGCAACGGT TGTCGCCCTG
GAGGGCGAAG ATGAATATCA GTTGATTGTG CGCGTGCTAA AAGAAACCAA CGGCGTGGTT
GCCGGGCCTA AAGGCGCTGC GCAACGTCTG GGGCTGAAAC GCACGACCCT GCTGTCACGG
ATGAAGCGGC TGGGAATTGA TAAATCGGCA TTGATTTAA
 
Protein sequence
MSYTPMSDLG QQGLFDITRT LLQQPDLASL CEALSQLVKR SALADNAAIV LWQAQTQRAS 
YYASREKDTP IKYEDETVLA HGPVRSILSR PDTLHCSYEE FCETWPQLDA GGLYPKFGHY
CLMPLAAEGH IFGGCEFIRY DDRPWSEKEF NRLQTFTQIV SVVTEQIQSR VVNNVDYELL
CRERDNFRIL VAITNAVLSR LDMDELVSEV AKEIHYYFDI DDISIVLRSH RKNKLNIYST
HYLDKQHPAH EQSEVDEAGT LTERVFKSKE MLLINLHERD DLAPYERMLF DTWGNQIQTL
CLLPLMSGDT MLGVLKLAQC EEKVFTTTNL NLLRQIAERV AIAVDNALAY QEIHRLKERL
VDENLALTEQ LNNVDSEFGE IIGRSEAMYS VLKQVEMVAQ SDSTVLILGE TGTGKELIAR
AIHNLSGRNN RRMVKMNCAA MPAGLLESDL FGHERGAFTG ASAQRIGRFE LADKSSLFLD
EVGDMPLELQ PKLLRVLQEQ EFERLGSNKI IQTDVRLIAA TNRDLKKMVA DREFRSDLYY
RLNVFPIHLP PLRERPEDIP LLAKAFTFKI ARRLGRNIDS IPAETLRTLS NMEWPGNVRE
LENVIERAVL LTRGNVLQLS LPDIVLPEPE TPPAATVVAL EGEDEYQLIV RVLKETNGVV
AGPKGAAQRL GLKRTTLLSR MKRLGIDKSA LI