Gene ECD_02702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02702 
SymbolygeV 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2834870 
End bp2836648 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content46% 
IMG OID 
Productpredicted DNA-binding transcriptional regulator 
Protein accessionACT44519 
Protein GI253978849 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCTTG CTACTACGCA GTCAGTATTG ATGCAAATTC AACCGACAAT TCAGCGTTTT 
GCCAGAATGC TTGCCAGCGT TTTGCAGCTT GAGGTTGAGA TCGTTGATGA AAACTTGTGT
CGCGTTGCCG GAACGGGCGC GTATGGGAAG TTTCTTGGTC GCCAGTTGAG CGGCAACTCA
CGCCTGCTCC GCCACGTCCT GGAAACGAAA ACTGAAAAAG TTGTGACACA GTCTCGCTTC
GATCCCCTTT GCGAAGGTTG CGATAGTAAA GAAAATTGCC GCGAAAAAGC ATTTCTGGGT
ACGCCTGTCA TTTTACAGGA TCGTTGTGTT GGGGTGATAA GTTTGATTGC CGTTACCCAC
GAGCAACAAG AGCATATCAG TGATAATTTA CGCGAATTTT CTGATTATGT TCGCCATATA
TCCACCATTT TTGTTTCGAA ACTTCTGGAG GATCAGGGGC CAGGAGATAA CATCAGTAAA
ATATTCGCGA CCATGATCGA TAATATGGAT CAGGGCGTAT TAGTTGTTGA TGATGAAAGT
CGGGTTCAGT TTGTTAATCA GACTGCCTTA AAAACACTTG GTGTTGTACA AAATAATATT
ATTGGGAAAC CTATCCGTTT CAGACCATTA ACATTTGAGA GTAATTTTAC TCATGGACAT
ATGCAGCATA TTGTTTCGTG GGACGATAAA AGTGAATTAA TCATTGGTCA ATTGCATAAC
ATTCAGGGCC GACAATTATT TTTAATGGCA TTTCACCAAT CGCATACCAG TTTTTCTGTA
GCAAATGCAC CTGATGAACC ACATATTGAA CAATTGGTTG GCGAGTGCCG TGTTATGCGG
CAATTAAAAC GACTCATTAG CCGTATTGCA CCCAGCCCAT CCAGCGTTAT GGTGGTTGGT
GAAAGCGGCA CGGGTAAAGA AGTCGTCGCC CGAGCAATCC ATAAGTTGAG CGGAAGACGG
AATAAACCCT TTATTGCTAT CAACTGTGCC GCGATTCCGG AGCAGCTTCT GGAAAGCGAA
CTGTTCGGTT ATGTTAAAGG CGCATTTACT GGCGCTTCTG CCAACGGTAA AACAGGGTTG
ATTCAGGCGG CGAATACGGG CACGCTGTTT CTCGATGAAA TAGGTGATAT GCCATTAATG
TTGCAGGCTA AATTACTGCG CGCTATTGAG GCGCGTGAAA TTCTGCCGAT TGGTGCCAGT
AGCCCAATAC AAGTCGACAT TCGCATCATT TCTGCAACTA ATCAGAATTT GGCCCAGTTC
ATTGCCGAAG GTAAATTCCG CGAAGATCTC TTCTACCGAC TTAATGTTAT CCCGATAACT
CTGCCACCGC TGCGTGAACG TCAGGAAGAT ATTGAACTAT TGGTGCATTA CTTTTTACAT
CTGCATACCC GTCGTCTGGG ATCGGTTTAT CCTGGCATTG CTCCCGATGT CGTCGAAATA
TTGCGTAAGC ATCGTTGGCC CGGAAACCTG CGCGAGTTAA GCAATTTGAT GGAATATCTG
GTTAACGTGG TTCCTTCAGG TGAAGTTATC GACAGCACGC TATTGCCGCC AAATCTGCTG
AATAATGGCA CAACGGAGCA AAGTGATGTA ACAGAGGTCA GTGAGGCGCA CCTGTCACTC
GATGATGCGG GCGGCACGGC GCTGGAGGAG ATGGAAAAGC AAATGATCCG CGAGGCGCTT
TCACGTCATA ACAGCAAGAA GCTAGTTGCT GATGAACTGG GCATCGGCAT TGCTACGCTC
TATCGCAAGA TTAAGAAATA TGAGTTGTTA AACACATAA
 
Protein sequence
MELATTQSVL MQIQPTIQRF ARMLASVLQL EVEIVDENLC RVAGTGAYGK FLGRQLSGNS 
RLLRHVLETK TEKVVTQSRF DPLCEGCDSK ENCREKAFLG TPVILQDRCV GVISLIAVTH
EQQEHISDNL REFSDYVRHI STIFVSKLLE DQGPGDNISK IFATMIDNMD QGVLVVDDES
RVQFVNQTAL KTLGVVQNNI IGKPIRFRPL TFESNFTHGH MQHIVSWDDK SELIIGQLHN
IQGRQLFLMA FHQSHTSFSV ANAPDEPHIE QLVGECRVMR QLKRLISRIA PSPSSVMVVG
ESGTGKEVVA RAIHKLSGRR NKPFIAINCA AIPEQLLESE LFGYVKGAFT GASANGKTGL
IQAANTGTLF LDEIGDMPLM LQAKLLRAIE AREILPIGAS SPIQVDIRII SATNQNLAQF
IAEGKFREDL FYRLNVIPIT LPPLRERQED IELLVHYFLH LHTRRLGSVY PGIAPDVVEI
LRKHRWPGNL RELSNLMEYL VNVVPSGEVI DSTLLPPNLL NNGTTEQSDV TEVSEAHLSL
DDAGGTALEE MEKQMIREAL SRHNSKKLVA DELGIGIATL YRKIKKYELL NT