Gene EcHS_A4513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4513 
Symbol 
ID5593018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4515915 
End bp4517111 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content37% 
IMG OID640923609 
Producthypothetical protein 
Protein accessionYP_001461050 
Protein GI157163732 
COG category[S] Function unknown 
COG ID[COG4269] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.00333203 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAAG TTATTAATGA AATGGATGTT CCGTCCCATT CGTTTGTTTT TCATGGTACA 
GGTGAGAGAT ATTTTCTTAT TTGTGTGGTG AATGTGTTGT TAACGATTAT AACGCTAGGT
ATCTATTTGC CATGGGCATT AATGAAATGT AAGCGTTATC TCTATGCTAA TATGGAAGTT
AACGGACAAC GATTTTCTTA TGGAATTACT GGTGGGAATG TTTTTGTTAG TTGTCTTGTT
TTTGTTTTTT GCTATTTCGC AATCTTAATG ACAGTGTCAG CAGATATGCC ACTTGTTGGC
TGTGTTTTGA CTTTGTTACT GTTGGTTTTG CTTATATTTA TGGCAGCAAA AGGACTGCGT
TATCAGGCCT TGATGACCAG TCTCAACGGC GTAAGATTTA GTTTTAATTG CTCTATGAAA
GGGTTCTGGT GGGTAACCTT TTTCTTGCCG ATTTTAATGG CCATTGGGAT GGGGACTGTT
TTCTTTATCT CGACAAAGAT GCTACATGCC AATAGTTCAA GTAGTGTTAT TATATCTGTG
GTTCTGATGG CAATAGTTGG TATTGTTTCC ATTGGTATTT TTAATGGTAC TTTATATAGC
CTGGTAATGA GTTTTCTCTG GAGCAATACC AGTTTCGGTA TACATCGTTT CAAGGTGAAA
TTAGATACTA CGTATTGTAT AAAATATGCC ATTCTCGCAT TTTTAGCTTT ATTACCTTTT
CTCGCTGTTG CTGGTTATAT TATCTTCGAT CAAATATTAA ATGCATATGA TAGTTCTGTG
TATGCAAATG ATGATATTGA GAATTTACAG CAATTTATGG AAATGCAACG TAAAATGATA
ATCGCGCAGT TAATCTATTA TTTTGGGATT GCTGTTAGCA CCAGTTATTT AACGGTGTCG
TTGCGAAATC ATTTTATGAG CAACCTGTCA CTGAATGATG GGCGTATTCG TTTTCGCTCA
ACTTTAACGT ACCACGGTAT GCTTTATCGC ATGTGTGCGT TGGTGGTGAT ATCCGGGATT
ACGGGCGGTC TGGCTTATCC ACTGCTGAAA TTATGGATGA TTGACTGGCA GGCAAAAAAT
ACGTATTTGC TGGGCGATTT GGATGACCTT CCTTTAATCA ATAAAGAAGA ACAACCAGAT
AAAGGCTTCT TAGCCAGGAT TTCACGGGGA ATTATGCCTT CTTTACCATT TCTGTAA
 
Protein sequence
MAQVINEMDV PSHSFVFHGT GERYFLICVV NVLLTIITLG IYLPWALMKC KRYLYANMEV 
NGQRFSYGIT GGNVFVSCLV FVFCYFAILM TVSADMPLVG CVLTLLLLVL LIFMAAKGLR
YQALMTSLNG VRFSFNCSMK GFWWVTFFLP ILMAIGMGTV FFISTKMLHA NSSSSVIISV
VLMAIVGIVS IGIFNGTLYS LVMSFLWSNT SFGIHRFKVK LDTTYCIKYA ILAFLALLPF
LAVAGYIIFD QILNAYDSSV YANDDIENLQ QFMEMQRKMI IAQLIYYFGI AVSTSYLTVS
LRNHFMSNLS LNDGRIRFRS TLTYHGMLYR MCALVVISGI TGGLAYPLLK LWMIDWQAKN
TYLLGDLDDL PLINKEEQPD KGFLARISRG IMPSLPFL