Gene EcHS_A0728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0728 
Symbol 
ID5593046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp738688 
End bp740094 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content51% 
IMG OID640919905 
ProductOprD family outer membrane porin 
Protein accessionYP_001457479 
Protein GI157160161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.00197385 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTACGT TTAGTGGCAA ACGTAGTACG CTGGCGCTGG CTATCGCCGG TGTTACAGCA 
ATGTCGGGCT TTATGGCAAT GCCGGAGGCT CGCGCCGAAG GATTCATCGA CGATTCAACC
TTAACCGGCG GTATATATTA CTGGCAGCGT GAACGCGACC GTAAAGATGT TACCGACGGC
GACAAATACA AAACCAACCT TTCTCACTCC ACCTGGAATG CCAACCTCGA TTTTCAGTCT
GGTTATGCTG CTGATATGTT CGGCCTTGAT ATTGCCGCGT TTACGGCGAT TGAAATGGCG
GAAAACGGCG ACAGCTCTCA CCCGAACGAA ATCGCGTTTT CAAAAAGTAA TAAAGCCTAT
GACGAAGACT GGTCCGGCGA CAAAAGCGGT ATAAGCCTGT ATAAAGCAGC GGCCAAATTT
AAATACGGTC CGGTTTGGGC GAGGGCAGGT TATATTCAGC CAACCGGTCA GACGCTGTTA
GCGCCTCACT GGAGCTTTAT GCCGGGTACT TATCAGGGTG CGGAAGCCGG AGCGAATTTT
GATTACGGCG ATGCCGGTGC GTTGAGTTTC TCCTACATGT GGACCAACGA ATACAAAGCG
CCGTGGCATC TGGAAATGGA TGAGTTTTAT CAGAACGATA AAACCACCAA AGTTGATTAT
CTGCACTCCC TTGGGGCGAA ATACGACTTC AAAAATAACT TCGTACTGGA AGCGGCTTTT
GGTCAGGCGG AAGGGTATAT CGATCAATAT TTTGCCAAAG CCAGCTACAA ATTTGATATC
GCCGGTAGCC CGTTAACCAC CAGCTACCAG TTCTACGGTA CCCGCGATAA AGTTGACGAT
CGCAGCGTCA ACGACCTTTA TGACGGCACC GCCTGGCTGC AGGCGTTGAC CTTTGGTTAC
CGGGCGGCTG ACGTAGTGGA TTTGCGCCTC GAAGGCACCT GGGTTAAGGC TGACGGTCAG
CAGGGATACT TCCTGCAACG TATGACTCCA ACCTACGCTT CCTCAAACGG TCGCCTGGAT
ATCTGGTGGG ACAACCGTTC TGACTTCAAC GCCAACGGCG AAAAAGCAGT CTTCTTCGGT
GCGATGTATG ACCTGAAAAA CTGGAATCTT CCAGGCTTCG CCATCGGCGC TTCCTACGTT
TACGCATGGG ATGCTAAACC TGCGACCTGG CAGAGCAATC CGGATGCGTA CTACGACAAA
AACCGGACTA TTGAAGAGTC TGCATACAGC CTGGATGCGG TCTATACCAT TCAGGACGGT
CGCGCCAAAG GCACGATGTT CAAACTGCAC TTCACCGAAT ACGACAACCA CTCCGACATC
CCAAGCTGGG GCGGTGGTTA CGGCAACATC TTCCAGGATG AGCGTGACGT AAAATTTATG
GTAATCGCAC CATTCACCAT CTTCTGA
 
Protein sequence
MRTFSGKRST LALAIAGVTA MSGFMAMPEA RAEGFIDDST LTGGIYYWQR ERDRKDVTDG 
DKYKTNLSHS TWNANLDFQS GYAADMFGLD IAAFTAIEMA ENGDSSHPNE IAFSKSNKAY
DEDWSGDKSG ISLYKAAAKF KYGPVWARAG YIQPTGQTLL APHWSFMPGT YQGAEAGANF
DYGDAGALSF SYMWTNEYKA PWHLEMDEFY QNDKTTKVDY LHSLGAKYDF KNNFVLEAAF
GQAEGYIDQY FAKASYKFDI AGSPLTTSYQ FYGTRDKVDD RSVNDLYDGT AWLQALTFGY
RAADVVDLRL EGTWVKADGQ QGYFLQRMTP TYASSNGRLD IWWDNRSDFN ANGEKAVFFG
AMYDLKNWNL PGFAIGASYV YAWDAKPATW QSNPDAYYDK NRTIEESAYS LDAVYTIQDG
RAKGTMFKLH FTEYDNHSDI PSWGGGYGNI FQDERDVKFM VIAPFTIF