Gene EcDH1_2956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2956 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3171379 
End bp3172785 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content51% 
IMG OID 
Productouter membrane porin 
Protein accessionACX40585 
Protein GI260450163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00243761 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTACGT TTAGTGGCAA ACGTAGTACG CTGGCGCTGG CTATCGCCGG TGTTACAGCA 
ATGTCGGGCT TTATGGCAAT GCCGGAGGCT CGCGCCGAAG GATTCATCGA CGATTCAACC
TTAACCGGCG GTATCTATTA CTGGCAGCGT GAACGCGACC GTAAAGATGT TACCGACGGC
GACAAATACA AAACCAACCT TTCTCACTCC ACCTGGAATG CCAACCTCGA TTTTCAGTCC
GGTTATGCTG CTGATATGTT CGGCCTTGAT ATTGCTGCGT TTACGGCGAT TGAAATGGCG
GAAAACGGCG ACAGCTCCCA CCCGAACGAA ATCGCGTTTT CAAAAAGTAA TAAAGCCTAT
GACGAAGACT GGTCCGGCGA CAAAAGCGGT ATAAGCCTGT ATAAAGCTGC GGCCAAATTT
AAATACGGTC CGGTTTGGGC GAGGGCAGGT TACATTCAGC CAACTGGTCA AACGCTGTTA
GCGCCGCACT GGAGCTTTAT GCCAGGTACT TATCAGGGGG CGGAAGCCGG GGCGAATTTT
GATTACGGCG ATGCTGGTGC GTTGAGTTTC TCCTACATGT GGACCAACGA ATACAAAGCG
CCGTGGCATC TGGAAATGGA TGAGTTTTAT CAGAACGATA AAACCACCAA AGTTGATTAT
CTGCACTCCT TTGGGGCGAA ATACGACTTC AAAAATAACT TCGTACTGGA AGCGGCATTT
GGTCAGGCGG AAGGGTATAT CGATCAATAT TTTGCCAAAG CCAGCTACAA ATTTGATATC
GCCGGTAGCC CGTTAACCAC CAGCTACCAG TTCTACGGTA CCCGAGATAA AGTTGACGAT
CGCAGCGTCA ACGACCTTTA TGACGGCACC GCCTGGCTGC AAGCGTTGAC CTTTGGTTAC
CGGGCGGCTG ACGTAGTGGA TTTGCGCCTC GAAGGCACCT GGGTTAAGGC TGACGGTCAG
CAGGGATACT TCCTGCAACG TATGACTCCA ACCTACGCTT CCTCAAACGG TCGCCTGGAT
ATCTGGTGGG ATAACCGTTC TGACTTCAAC GCCAACGGCG AAAAAGCGGT CTTCTTCGGT
GCGATGTATG ACCTGAAAAA CTGGAATCTT CCAGGCTTCG CCATCGGCGC TTCCTACGTT
TACGCATGGG ATGCTAAACC TGCGACCTGG CAGAGCAATC CGGATGCGTA CTACGACAAA
AACCGGACTA TTGAAGAGTC TGCCTACAGC CTGGATGCGG TCTATACCAT TCAGGACGGT
CGCGCCAAAG GCACGATGTT CAAACTGCAT TTCACCGAAT ACGACAACCA CTCCGACATC
CCAAGCTGGG GCGGTGGTTA CGGCAACATC TTCCAGGATG AGCGTGACGT GAAATTTATG
GTAATCGCAC CATTCACCAT CTTCTGA
 
Protein sequence
MRTFSGKRST LALAIAGVTA MSGFMAMPEA RAEGFIDDST LTGGIYYWQR ERDRKDVTDG 
DKYKTNLSHS TWNANLDFQS GYAADMFGLD IAAFTAIEMA ENGDSSHPNE IAFSKSNKAY
DEDWSGDKSG ISLYKAAAKF KYGPVWARAG YIQPTGQTLL APHWSFMPGT YQGAEAGANF
DYGDAGALSF SYMWTNEYKA PWHLEMDEFY QNDKTTKVDY LHSFGAKYDF KNNFVLEAAF
GQAEGYIDQY FAKASYKFDI AGSPLTTSYQ FYGTRDKVDD RSVNDLYDGT AWLQALTFGY
RAADVVDLRL EGTWVKADGQ QGYFLQRMTP TYASSNGRLD IWWDNRSDFN ANGEKAVFFG
AMYDLKNWNL PGFAIGASYV YAWDAKPATW QSNPDAYYDK NRTIEESAYS LDAVYTIQDG
RAKGTMFKLH FTEYDNHSDI PSWGGGYGNI FQDERDVKFM VIAPFTIF