Gene ECD_03075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03075 
SymbolarcB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3219189 
End bp3221525 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content50% 
IMG OID 
Producthybrid sensory histidine kinase in two-component regulatory system with ArcA 
Protein accessionACT44879 
Protein GI253979209 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAA TTCGTCTGCT GGCGCAGTAT TATGTTGACC TGATGATGAA GTTAGGTCTG 
GTGCGCTTCT CAATGTTGCT GGCGCTGGCC CTCGTCGTTC TTGCCATTGT GGTACAAATG
GCGGTAACCA TGGTGCTGCA TGGTCAGGTC GAAAGCATTG ATGTTATTCG TTCTATCTTC
TTTGGTTTGC TGATTACGCC GTGGGCGGTC TACTTTCTAT CGGTGGTCGT CGAGCAACTG
GAGGAGTCAC GACAACGTCT GTCACGGCTG GTGCAAAAAC TGGAGGAGAT GCGCGAGCGC
GATTTGAGCC TCAACGTTCA GTTAAAAGAT AATATTGCCC AGCTAAATCA GGAAATTGCC
GTTCGTGAAA AAGCGGAAGC AGAACTGCAG GAAACCTTCG GCCAACTGAA AATTGAAATC
AAAGAGCGCG AAGAGACACA AATTCAGCTC GAGCAGCAAT CCTCATTCTT ACGTTCCTTC
CTTGATGCTT CACCCGACCT GGTTTTTTAT CGTAATGAAG ATAAAGAGTT TTCCGGCTGT
AACCGCGCGA TGGAGCTGCT GACCGGAAAA AGCGAAAAAC AACTGGTTCA TCTGAAACCT
GCTGATGTTT ACTCACCGGA AGCCGCTGCG AAAGTCATTG AAACCGATGA AAAAGTGTTC
CGTCATAATG TGTCACTGAC CTATGAGCAG TGGCTGGATT ATCCCGACGG ACGCAAAGCC
TGCTTTGAAA TTCGTAAAGT GCCGTACTAC GACCGCGTGG GTAAACGTCA CGGTTTGATG
GGCTTTGGTC GCGACATTAC CGAGCGTAAG CGGTATCAGG ATGCGCTTGA ACGCGCCAGC
CGCGACAAAA CGACGTTTAT TTCCACCATC AGTCACGAAT TGCGTACACC GCTGAACGGT
ATCGTCGGCC TGAGCCGCAT CCTACTGGAT ACCGAACTCA CCGCCGAGCA GGAAAAATAT
CTCAAAACTA TCCATGTTTC GGCCGTCACG CTGGGGAATA TCTTCAACGA TATTATCGAC
ATGGATAAGA TGGAACGGCG CAAGGTCCAG CTTGATAATC AGCCGGTTGA TTTCACCAGC
TTCCTTGCCG ATCTGGAAAA TCTCTCCGCC TTGCAGGCGC AACAAAAAGG ATTGCGCTTT
AACCTGGAGC CTACGCTGCC ATTACCGCAT CAGGTCATTA CCGACGGGAC GCGTTTACGG
CAGATCCTGT GGAACCTCAT CAGTAACGCC GTCAAATTCA CCCAGCAAGG CCAGGTTACC
GTGCGCGTGC GCTACGATGA AGGCGATATG CTGCATTTTG AAGTGGAAGA TTCTGGTATC
GGCATTCCGC AGGATGAGCT GGATAAAATC TTCGCCATGT ATTACCAGGT GAAAGACAGT
CATGGCGGTA AACCTGCCAC CGGCACCGGT ATTGGTCTGG CCGTTTCTCG TCGTCTGGCG
AAAAATATGG GCGGCGATAT TACGGTTACC AGCGAACAGG GCAAAGGTTC AACCTTTACG
TTGACGATCC ACGCACCGTC GGTGGCAGAA GAGGTCGATG ATGCGTTTGA TGAAGACGAT
ATGCCTTTAC CGGCGCTGAA TGTGCTGCTG GTGGAAGACA TTGAACTGAA CGTGATTGTC
GCGCGTTCTG TGCTGGAAAA ATTAGGTAAC AGCGTTGATG TCGCCATGAC CGGCAAGGCG
GCGCTGGAGA TGTTTAAACC GGGCGAATAC GACCTGGTGT TGCTGGATAT TCAGTTGCCA
GATATGACCG GGCTGGATAT CTCTCGTGAA CTGACGAAGC GTTATCCGCG CGAGGATTTA
CCGCCGCTGG TGGCCTTAAC CGCTAACGTG CTGAAAGACA AACAAGAGTA CCTCAATGCT
GGAATGGATG ATGTGCTGAG TAAGCCGCTT TCTGTTCCGG CGCTAACCGC GATGATCAAG
AAATTCTGGG ATACCCAGGA TGATGAGGAG AGTACGGTGA CGACAGAAGA GAACAGTAAA
TCAGAAGCAT TGCTCGATAT TCCCATGCTG GAACAGTATC TCGAACTTGT AGGACCGAAG
CTGATCACCG ACGGGTTAGC GGTGTTTGAG AAGATGATGC CAGGCTATGT CAGCGTGCTG
GAGTCGAATC TGACGGCGCA GGATAAAAAA GGCATTGTTG AGGAAGGACA TAAAATTAAA
GGTGCGGCGG GGTCAGTGGG GTTACGCCAT CTGCAACAGC TGGGTCAGCA AATTCAGTCT
CCTGACCTTC CTGCCTGGGA AGATAACGTC GGTGAATGGA TTGAAGAGAT GAAAGAAGAG
TGGCGTCACG ACGTAGAAGT GCTGAAAGCG TGGGTGGCAA AAGCCACTAA AAAATGA
 
Protein sequence
MKQIRLLAQY YVDLMMKLGL VRFSMLLALA LVVLAIVVQM AVTMVLHGQV ESIDVIRSIF 
FGLLITPWAV YFLSVVVEQL EESRQRLSRL VQKLEEMRER DLSLNVQLKD NIAQLNQEIA
VREKAEAELQ ETFGQLKIEI KEREETQIQL EQQSSFLRSF LDASPDLVFY RNEDKEFSGC
NRAMELLTGK SEKQLVHLKP ADVYSPEAAA KVIETDEKVF RHNVSLTYEQ WLDYPDGRKA
CFEIRKVPYY DRVGKRHGLM GFGRDITERK RYQDALERAS RDKTTFISTI SHELRTPLNG
IVGLSRILLD TELTAEQEKY LKTIHVSAVT LGNIFNDIID MDKMERRKVQ LDNQPVDFTS
FLADLENLSA LQAQQKGLRF NLEPTLPLPH QVITDGTRLR QILWNLISNA VKFTQQGQVT
VRVRYDEGDM LHFEVEDSGI GIPQDELDKI FAMYYQVKDS HGGKPATGTG IGLAVSRRLA
KNMGGDITVT SEQGKGSTFT LTIHAPSVAE EVDDAFDEDD MPLPALNVLL VEDIELNVIV
ARSVLEKLGN SVDVAMTGKA ALEMFKPGEY DLVLLDIQLP DMTGLDISRE LTKRYPREDL
PPLVALTANV LKDKQEYLNA GMDDVLSKPL SVPALTAMIK KFWDTQDDEE STVTTEENSK
SEALLDIPML EQYLELVGPK LITDGLAVFE KMMPGYVSVL ESNLTAQDKK GIVEEGHKIK
GAAGSVGLRH LQQLGQQIQS PDLPAWEDNV GEWIEEMKEE WRHDVEVLKA WVAKATKK