Gene EcDH1_2187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2187 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2345526 
End bp2346662 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content42% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionACX39838 
Protein GI260449416 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00361348 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTTA AAAAATTGAT GGGACATATT TCTATTATCC CCGATTACAG ACAAGCCTGG 
AAAATGGAAC ATAAGTTATC GGATATTCTA CTGTTGACTA TTTGTGCCGT TATTTCTGGT
GCAGAAGGCT GGGAAGATAT AGAGGATTTT GGGGAAACAC ATCCCGATTT TTTGAAGCAA
TATGGTGATT TTGAAAATGG TATTCCTGTT CACGACACCA TTGCCAGAGT TGTATCCTGT
ATCAGTCCTG CAAAATTTCA CGAGTGCTTT ATTAACTGGA TGCGTGACTG CCATTCTTCA
GATGATAAAG ACGTCATTGC AATTGATGGA AAAACGCTCC GGCATTCTTA TGATAAGAGT
CGCCGCAGGG GAGCGATTCA TGTCATTAGT GCGTTCTCAA CAATGCACAG TCTGGTCATC
GGACAGATCA AGACGGATGA GAAATCTAAT GAGATTACAG CTATCCCAGA ACTTCTTAAC
ATGCTGGATA TTAAAGGAAA AATCATCACA ACTGATGCGA TGGGTTGCCA GAAAGATATT
GCAGAGAAGA TACAAAAACA GGGAGGTGAT TATTTATTCG CGGTAAAAGG AAACCAGGGG
CGGCTAAATA AAGCCTTTGA GGAAAAATTT CCGCTGAAAG AATTAAATAA TCCAGCGCAT
GACAGTTACG CAATGAGTGA AAAGAGTCAC GGCAGAGAAG AAATCCGTCT TCATATTGTT
TGCGATGTCC CTGATGAACT TATTGATTTC ACGTTTGAAT GGAAAGGGCT GAAGAAATTA
TGCGTGGCAG TCTCCTTTCG GTCCATAATA GCAGAACAAA AGAAAGAGCT CGAAATGACG
GTCAGATATT ATATCAGTTC TGCTGATTTA ACCGCAGAGA AGTTCGCCAC AGCAATCCGA
AACCACTGGC ATGTGGAGAA TAAGCTGCAC TGGCGTCTGG ACGTGGTAAT GAATGAAGAC
GACTGCAAAA TAAGAAGAGG AAATGCAGCA GAATTATTTT CAGGGATACG GCACATTGCT
ATTAATATTT TGACGAATGA TAAGGTATTC AAGGCAGGGT TAAGACGTAA GATGCGAAAA
GCAGCCATGG ACAGAAACTA CCTGGCGTCA GTCCTTACGG GGAGCGGGCT TTCGTAA
 
Protein sequence
MELKKLMGHI SIIPDYRQAW KMEHKLSDIL LLTICAVISG AEGWEDIEDF GETHPDFLKQ 
YGDFENGIPV HDTIARVVSC ISPAKFHECF INWMRDCHSS DDKDVIAIDG KTLRHSYDKS
RRRGAIHVIS AFSTMHSLVI GQIKTDEKSN EITAIPELLN MLDIKGKIIT TDAMGCQKDI
AEKIQKQGGD YLFAVKGNQG RLNKAFEEKF PLKELNNPAH DSYAMSEKSH GREEIRLHIV
CDVPDELIDF TFEWKGLKKL CVAVSFRSII AEQKKELEMT VRYYISSADL TAEKFATAIR
NHWHVENKLH WRLDVVMNED DCKIRRGNAA ELFSGIRHIA INILTNDKVF KAGLRRKMRK
AAMDRNYLAS VLTGSGLS