Gene EcolC_1532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1532 
Symbol 
ID6065895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1694729 
End bp1698523 
Gene Length3795 bp 
Protein Length1264 aa 
Translation table11 
GC content50% 
IMG OID641600949 
ProductWGR domain-containing protein 
Protein accessionYP_001724519 
Protein GI170019565 
COG category[S] Function unknown 
COG ID[COG3831] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACACT TTATCTATCA GGACGAAAAA TCACATAAAT TCTGGGCGGT GGAGCAACAG 
GGAAACGAGT TGCATATCAG TTGGGGAAAA GTTGGCACCA AAGGGCAAAG TCAGATAAAA
AGTTTTTCAG ATGCTGCGGC AGCGGAAAAA GCGGAACTTA AGCTGATTGC GGAGAAGGTG
AAGAAGGGGT ATGTGGAGCA AGCGAAGGAT AATTCTTTGC AACCTTCCCA AACGGTAACG
GGCTCTCTCA AGGTAGCGGA CTTATCCACC ATTATTCAGG AACAACCCTC TTTCGTAGCA
GAAACCCGTG CGCCTGACAA AAATACAGAT GCTGTTTTAC CGTGGCTGGC GAAAGATATT
GCTGTCGTTT TTCCGCCCGA AGTTGTACAC ACCACGTTAA GTCATCGCCG CTTTCCCGGA
GTTCCTGTTC AGCAAGCAGA CAAATTGACC CAATTACGTC GCTTAGCCTG TAGTGTGTCG
CAACGGGATA ATAAAACAGC CACATTTGAC TTCAGCGCCT GTTCTTTAGA ATGGCAAAAC
ACCGTCGCCC AGGCGATCAG TCAGATCGAC GGCCTGAAAA CAACACAGTT ACCATCACCA
GTAATGGCTG TACTCACGGC ACTTGAAATG AAATGCACAA GATATAAAGT GCGTGAGGAT
GTTATGGATC AGATCGTCCA GGAAGGCGGT CTGGAATATG CTACTGATGT AATAATACAC
CTTCAACAGA TTGATATTGA ATGGGATTAT GCGAATAATG TCATTATTAT TCTGCCGTCT
GGCATTGCAC CTAGCTACTT GGAGCAATAT TCCAGATTTG AATTACGCCT ACGTAAACAT
TTATCACTGA CGGAAGAGTC TCTCTGGCAA AAATGTGCAC AAAAACTTAT TGCCGCAATT
CCACATATTC CAGAATGGCG GCAACCATTA ATTGCTTTGT TATTACCCGA AAAACCAGAA
ATTGCACATG AAATTGCCCA GCGTCTACTG GGGCAAAAAA AATTACCCTC GCTTGAGTGG
TTAAAAATAG TGGCGACTGA TGAGCACATT CTTGCCTCAT TAGAAAAATA TCACGAACCA
TATGCCATTT TTGATGATTA CTATTGTGGT GCGATATGGT CAGCCACCGT ATTACAGGAG
CAAGGTGTTG CAGCCCTGCC CCGATTTGCT CCCTATGCCG CAAGTGACTA CTGCGCCGAT
GTGTTGCGTC ATATCAATCA TCCGTTCGCA TTGACACTGC TTATACGTGT AGCCGGGCAA
ACTAAACGCT GTCACGATCG GATGACGAAA GCCATTGCTG CGTTCCCACA TGCAGCAATG
GCGGCACTGA CGGAACTTCT TGGGCAAAAA GAAGAGAACA GTTGGCGCAT TATGCTAATG
ACAATGCTTA TCTCACAACC AGCACTGGCA GAACAGGTCA TTCCCTGGCT CTCGACACCC
GCAGTTGCCG TACTGAAATC ATGCCAGCAA CAACTGACAC AGCCCTCAAA CCATGCCAGC
GCCGATCTAC TGCCAGCCGT AGTAGTCTCC CCTCCCTGGC TTTCGAAAAA GAAAAAATCG
CCGATTCCGG TGCTGGATTT AGCGCCATTA GGCATTGAGC CAATCTGTTA TCTGACAGAA
GAAATCAGTA ATCAACTTTT GGCGAAATAT ATCTGGTATT CAAAACACAT CACGGTTAGC
CATGAAGAAA GTACTACCAA CCTGTTGGCA AGGATGGGTT TTCAACGACG GATCGCTGGT
ACATATATTA AAGCTCCCGA AGCGGTAGTT GAGGCATGGC TAAATGAAGA TTATTCAACC
TTACTAAGTG AATTTAAGGT GTTTCATTCA CCTACCGGGC ATTATTGGCA GTTGGGGATT
TTGACAACAT TGCCGCTGGA GAAAGCAGTA AAAGCATGGA ATGCCCTTAC CCTATCTCCA
CATACCGATA CCGAATACTC CATGTTACAT TTTGGACTCA AAGGGTTACC TGGGTTAGTA
AACTCACTTG CACGCTATCC ACAAGAAGCC TTGCCCATCA CGAATTACTT CGCAGCGAGT
GAGCTGGCTC CTGCCGTCGC CCGTGCCTTC AACAAACTGA AAACGCTACG CGAAAACGCC
CGTAGCTGGC TGTTGAAATA CCCGGAACAT GCCCTTACCG GCCTGCTGCC TGCGGCGCTC
GGCAAAGCCG GTGAAGCACA GGATAACGCC CGCGCTGCCT TGCGTATGCT TACCGAAAAC
GGTCATCAGC CATTACTGCA AGAAATCGCC CGACGTTATA ACCAGCCGGA AGTAACCGAT
GCGGTGAACG CTCTGCTTGC GCTCGATCCC TTAGATAATC ACCCGACAAA AATCCCCACT
CTTCCGGCCT TTTATCAGCC ATCGCTCTGG ACGCGCCCGG TATTAAAAGC AAATGCCCAA
TCACTGCCAG ATAGCGCCCT CCTCCACCTC GGTGAAATGC TCCGCTTCCC TCAGGAAGAG
GCTCTGTATC CGGGATTATT GCAGGTGAAA GACGTCTGTT CCGCCGACTC ACTGGCGGGA
TTTGCCTGGG ATCTGTTTAC CGCCTGGCAG ACCGCTGGCG CGCCGTCGAA AGAGAGTTGG
GCGTTCACTG CGTTAGGCGT TCTCGGTAAC GATGACACCG CCCGCAAACT GACGCCATTA
ATACGCGCCT GGCCTGGTGA ATCCCAGCAT AAACGCGCCA CCGTTGGGTT GGATATTCTC
GCTGCTATCG GTAGTGATAT CGCCCTTATG CAGCTTAACG GCATCGCCCA GAAACTGAAA
TTCAAAGCAT TACAGGAGCG GGCAAAAGAA AAAATTGCCG ACATTGCCGA GAGCCGCGAA
CTCACGGTGG CGGAGCTTGA AGATCGGTTA GCACCGGATC TCGGTCTGGA TGATAACGGT
TCGCTGCTGC TGGATTTTGG CCCACGGCAG TTCACCGTCA GCTTTGATGA AACCTTAAAA
CCGTTTGTGC GTGATGTTTC CGGCAGCCGC CTGAAAGACT TGCCCAAACC GAACAAAAGC
GATGATGAAA CGCGGGCGAA CGATGCGGTT AACCGCTACA AATTGCTGAA AAAAGATGCG
CGTACCATCG CCGCCCAGCA GGTAGCAAGG CTGGAATCCG CCATGTGCCT GCGCCGCCGC
TGGTCGCTGG AAAACTTCCA GCTCTTCCTG GTTGAGCATC CGCTGGTTCG TCACTTAACC
CGCCGTCTGA TTTGGGGCGT TTATAGCGCC GAAAACCAGC TACTGGCTTG CTTTCGCGTA
GCAGAAGATA ACAGCTCCAG CACCGCTGAC GATGATCTTT TCACCCTGCC GGAAGGCGAT
ATCTCTATCG GCACTCCTCA CGTTCTGGAA ATATCACCAA CGGATGCTGC CGCCTTTGGT
CAGCTTTTTG CCGACTACGA ACTGCTACCA CCGTTCCGCC AGCTCGACCG TAACAGCTAC
GCCCTGACAG AAGCCGAGCG CAATGCCAGT GAACTGACCC GCTGGGCAGG CAGAAAATGC
CCAAGTGGTC GGGTCATGGG GCTGGCGAAT AAAGGCTGGA TAAAGGGCGA ACCACAGGAT
GGAGGCTGGA TCGGATGGAT GATCAAACCT TTGGGTCGCT GGTCGTTAAT CATGGAAATC
GATGAAGGCT TTGCGGTAGG CATGTCGCCA GCCGAACTCA GCGCTGAGCA GCTCTTAAGC
AAGCTGTGGC TATGGGAAGG CAAAGCAGAA AGATATGGCT GGGGGAGTAA TTCAACACAG
GAAGCGCAGT TCTCCGTAAT CGATGCCATC ACCGCCAGCG AGCTAATTAA CGATATTGAA
GCCCTGTTTG AATAA
 
Protein sequence
MRHFIYQDEK SHKFWAVEQQ GNELHISWGK VGTKGQSQIK SFSDAAAAEK AELKLIAEKV 
KKGYVEQAKD NSLQPSQTVT GSLKVADLST IIQEQPSFVA ETRAPDKNTD AVLPWLAKDI
AVVFPPEVVH TTLSHRRFPG VPVQQADKLT QLRRLACSVS QRDNKTATFD FSACSLEWQN
TVAQAISQID GLKTTQLPSP VMAVLTALEM KCTRYKVRED VMDQIVQEGG LEYATDVIIH
LQQIDIEWDY ANNVIIILPS GIAPSYLEQY SRFELRLRKH LSLTEESLWQ KCAQKLIAAI
PHIPEWRQPL IALLLPEKPE IAHEIAQRLL GQKKLPSLEW LKIVATDEHI LASLEKYHEP
YAIFDDYYCG AIWSATVLQE QGVAALPRFA PYAASDYCAD VLRHINHPFA LTLLIRVAGQ
TKRCHDRMTK AIAAFPHAAM AALTELLGQK EENSWRIMLM TMLISQPALA EQVIPWLSTP
AVAVLKSCQQ QLTQPSNHAS ADLLPAVVVS PPWLSKKKKS PIPVLDLAPL GIEPICYLTE
EISNQLLAKY IWYSKHITVS HEESTTNLLA RMGFQRRIAG TYIKAPEAVV EAWLNEDYST
LLSEFKVFHS PTGHYWQLGI LTTLPLEKAV KAWNALTLSP HTDTEYSMLH FGLKGLPGLV
NSLARYPQEA LPITNYFAAS ELAPAVARAF NKLKTLRENA RSWLLKYPEH ALTGLLPAAL
GKAGEAQDNA RAALRMLTEN GHQPLLQEIA RRYNQPEVTD AVNALLALDP LDNHPTKIPT
LPAFYQPSLW TRPVLKANAQ SLPDSALLHL GEMLRFPQEE ALYPGLLQVK DVCSADSLAG
FAWDLFTAWQ TAGAPSKESW AFTALGVLGN DDTARKLTPL IRAWPGESQH KRATVGLDIL
AAIGSDIALM QLNGIAQKLK FKALQERAKE KIADIAESRE LTVAELEDRL APDLGLDDNG
SLLLDFGPRQ FTVSFDETLK PFVRDVSGSR LKDLPKPNKS DDETRANDAV NRYKLLKKDA
RTIAAQQVAR LESAMCLRRR WSLENFQLFL VEHPLVRHLT RRLIWGVYSA ENQLLACFRV
AEDNSSSTAD DDLFTLPEGD ISIGTPHVLE ISPTDAAAFG QLFADYELLP PFRQLDRNSY
ALTEAERNAS ELTRWAGRKC PSGRVMGLAN KGWIKGEPQD GGWIGWMIKP LGRWSLIMEI
DEGFAVGMSP AELSAEQLLS KLWLWEGKAE RYGWGSNSTQ EAQFSVIDAI TASELINDIE
ALFE