Gene EcDH1_3959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3959 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4265159 
End bp4266499 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content52% 
IMG OID 
Productporin LamB type 
Protein accessionACX41559 
Protein GI260451137 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATTA CTCTGCGCAA ACTTCCTCTG GCGGTTGCCG TCGCAGCGGG CGTAATGTCT 
GCTCAGGCAA TGGCTGTTGA TTTCCACGGC TATGCACGTT CCGGTATTGG TTGGACAGGT
AGCGGCGGTG AACAACAGTG TTTCCAGACT ACCGGTGCTC AAAGTAAATA CCGTCTTGGC
AACGAATGTG AAACTTATGC TGAATTAAAA TTGGGTCAGG AAGTGTGGAA AGAGGGCGAT
AAGAGCTTCT ATTTCGACAC TAACGTGGCC TATTCCGTCG CACAACAGAA TGACTGGGAA
GCTACCGATC CGGCCTTCCG TGAAGCAAAC GTGCAGGGTA AAAACCTGAT CGAATGGCTG
CCAGGCTCCA CCATCTGGGC AGGTAAGCGC TTCTACCAAC GTCATGACGT TCATATGATC
GACTTCTACT ACTGGGATAT TTCTGGTCCT GGTGCCGGTC TGGAAAACAT CGATGTTGGC
TTCGGTAAAC TCTCTCTGGC AGCAACCCGC TCCTCTGAAG CTGGTGGTTC TTCCTCTTTC
GCCAGCAACA ATATTTATGA CTATACCAAC GAAACCGCGA ACGACGTTTT CGATGTGCGT
TTAGCGCAGA TGGAAATCAA CCCGGGCGGC ACATTAGAAC TGGGTGTCGA CTACGGTCGT
GCCAACTTGC GTGATAACTA TCGTCTGGTT GATGGCGCAT CGAAAGACGG CTGGTTATTC
ACTGCTGAAC ATACTCAGAG TGTCCTGAAG GGCTTTAACA AGTTTGTTGT TCAGTACGCT
ACTGACTCGA TGACCTCGCA GGGTAAAGGG CTGTCGCAGG GTTCTGGCGT TGCATTTGAT
AACGAAAAAT TTGCCTACAA TATCAACAAC AACGGTCACA TGCTGCGTAT CCTCGACCAC
GGTGCGATCT CCATGGGCGA CAACTGGGAC ATGATGTACG TGGGTATGTA CCAGGATATC
AACTGGGATA ACGACAACGG CACCAAGTGG TGGACCGTCG GTATTCGCCC GATGTACAAG
TGGACGCCAA TCATGAGCAC CGTGATGGAA ATCGGCTACG ACAACGTCGA ATCCCAGCGC
ACCGGCGACA AGAACAATCA GTACAAAATT ACCCTCGCAC AACAATGGCA GGCTGGCGAC
AGCATCTGGT CACGCCCGGC TATTCGTGTC TTCGCAACCT ACGCCAAGTG GGATGAGAAA
TGGGGTTACG ACTACACCGG TAACGCTGAT AACAACGCGA ACTTCGGCAA AGCCGTTCCT
GCTGATTTCA ACGGCGGCAG CTTCGGTCGT GGCGACAGCG ACGAGTGGAC CTTCGGTGCC
CAGATGGAAA TCTGGTGGTA A
 
Protein sequence
MMITLRKLPL AVAVAAGVMS AQAMAVDFHG YARSGIGWTG SGGEQQCFQT TGAQSKYRLG 
NECETYAELK LGQEVWKEGD KSFYFDTNVA YSVAQQNDWE ATDPAFREAN VQGKNLIEWL
PGSTIWAGKR FYQRHDVHMI DFYYWDISGP GAGLENIDVG FGKLSLAATR SSEAGGSSSF
ASNNIYDYTN ETANDVFDVR LAQMEINPGG TLELGVDYGR ANLRDNYRLV DGASKDGWLF
TAEHTQSVLK GFNKFVVQYA TDSMTSQGKG LSQGSGVAFD NEKFAYNINN NGHMLRILDH
GAISMGDNWD MMYVGMYQDI NWDNDNGTKW WTVGIRPMYK WTPIMSTVME IGYDNVESQR
TGDKNNQYKI TLAQQWQAGD SIWSRPAIRV FATYAKWDEK WGYDYTGNAD NNANFGKAVP
ADFNGGSFGR GDSDEWTFGA QMEIWW