Gene EcDH1_3955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3955 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4262187 
End bp4263515 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content33% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX41555 
Protein GI260451133 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTGAATGCGC TTGCAATTTT CTGATGGATA AAGATGCGCA GGGGTATATC 
GACCTGTCTG ATTTGGATTT AACAAGTTGT CATTTTAAAG GTGACGTTAT ATCGAAGGTG
TCTTTTTTAT CATCAAATCT ACAACATGTA ACATTCGAAT GTAAAGAAAT TGGGGATTGC
AATTTTACTA CTGCAATAGT TGATAATGTC ATATTTAGAT GTCGACGTTT ACACAATGTG
ATTTTTATCA AAGCGAGTGG TGAATGTGTC GATTTCAGCA AAAATATTCT TGATACAGTT
GACTTCTCGC AGAGTCAACT TGGTCATAGT AATTTTCGCG AATGTCAGAT TAGAAATTCA
AACTTCGATA ATTGTTATCT TTACGCTTCG CACTTCACCA GAGCAGAGTT TCTGTCTGCC
AAAGAAATAT CATTTATTAA ATCGAATTTG ACAGCTGTTA TGTTTGATTA TGTGCGAATG
TCGACAGGGA ATTTTAAAGA TTGCATTACA GAACAATTGG AATTAACTAT TGATTATTCA
GATATATTTT GGAATGAAGA TCTCGATGGT TATATCAATA ACATTATAAA AATGATTGAT
ACATTGCCAG ATAATGCAAT GATATTGAAA TCCGTTCTGG CCGTAAAACT GGTGATGCAA
TTAAAAATAC TTAATATTGT TAATAAAAAC TTTATTGAGA ATATGAAGAA AATATTTAGC
CATTGTCCTT ATATAAAAGA TCCCATTATA CGCAGTTATA TCCATTCTGA TGAAGATAAC
AAGTTCGATG ATTTTATGCG TCAACATCGA TTCAGTGAGG TGAATTTCGA TACCCAACAG
ATGATCGATT TTATTAACAG ATTTAATACG AATAAATGGC TAATTGATAA AAATAACAAT
TTTTTTATCC AACTTATCGA TCAGGCCTTA CGATCAACGG ATGATATGAT CAAAGCAAAT
GTTTGGCATC TTTATAAAGA GTGGATTCGT AGTGATGATG TTTCACCTAT ATTTATAGAA
ACTGAAGATA ATTTAAGAAC CTTTAACACG AATGAATTAA CACGAAACGA TAATATCTTT
ATCCTGTTCT CCTCAGTCGA TGATGGGCCA GTTATGGTGG TAAGCTCCCA GCGCTTACAT
GATATGTTGA ATCCTACAAA AGATACCAAT TGGAATTCCA CGTATATCTA CAAATCCAGA
CATGAGATGT TGCCTGTTAA TCTTACTCAG GAAACACTTT TCAGCTCCAA ATCTCATGGT
AAATATGCGC TTTTCCCCAT TTTTACTGCG AGTTGGCGAG CTCATCGTAT AATGAATAAG
GGTGTTTAA
 
Protein sequence
MKKIECACNF LMDKDAQGYI DLSDLDLTSC HFKGDVISKV SFLSSNLQHV TFECKEIGDC 
NFTTAIVDNV IFRCRRLHNV IFIKASGECV DFSKNILDTV DFSQSQLGHS NFRECQIRNS
NFDNCYLYAS HFTRAEFLSA KEISFIKSNL TAVMFDYVRM STGNFKDCIT EQLELTIDYS
DIFWNEDLDG YINNIIKMID TLPDNAMILK SVLAVKLVMQ LKILNIVNKN FIENMKKIFS
HCPYIKDPII RSYIHSDEDN KFDDFMRQHR FSEVNFDTQQ MIDFINRFNT NKWLIDKNNN
FFIQLIDQAL RSTDDMIKAN VWHLYKEWIR SDDVSPIFIE TEDNLRTFNT NELTRNDNIF
ILFSSVDDGP VMVVSSQRLH DMLNPTKDTN WNSTYIYKSR HEMLPVNLTQ ETLFSSKSHG
KYALFPIFTA SWRAHRIMNK GV