Gene EcDH1_1309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1309 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1409270 
End bp1410427 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content48% 
IMG OID 
Productintegrase family protein 
Protein accessionACX38982 
Protein GI260448560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00894189 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCACCG TTAAGCAGAT TGAAGCAGCA AAGCCGAAAG AAAAACCATA CCGCCTTCTC 
GATGGTAATG GCCTGTACCT TTATGTCCCT GTGTCAGGGA AAAAGGTATG GCAGCTTCGC
TACAAGATTG ACGGTAAGGA GAAAATCCTG ACCGTCGGAA AATATCCGCT TATGACTTTG
CAGGAGGCAA GGGATAAAGC ATGGACTGCG AGGAAAGACA TCTCGGTTGG CATCGATCCT
GTAAAGGCGA AAAAGGCTTC GTCTAACAAC AATTCCTTTA GTGCGATTTA CAAGGAATGG
TACGAGCACA AGAAGCAAGT ATGGTCAGTA GGGTATGCAA CTGAACTTGC CAAAATGTTT
GACGACGACA TTTTACCTAT CATTGGCGGC CTTGAAATTC AGGATATTGA GCCGATGCAA
CTGCTGGAAG TAATCCGCAG GTTTGAAGAT CGCGGTGCAA TGGAACGAGC CAACAAAGCA
CGCAGAAGAT GCGGCGAGGT TTTCCGTTAC GCTATTGTCA CCGGAAGGGC TAAATATAAC
CCGGCACCTG ACCTTGCTGA CGCCATGAAG GGATACCGCA AGAAGAACTT CCCGTTTCTT
CCTGCAGACC AGATCCCGGC ATTCAACAAA GCACTGGCAA CATTTTCAGG AAGTATCGTA
TCGCTCATTG CGACCAAAGT TTTACGCTAC ACAGCCCTAA GAACGAAAGA GCTTCGTTCC
ATGCTATGGA AGAACGTCGA TTTTGAAAAT AGGATTATCA CCATCGACGC CAGTGTGATG
AAAGGACGCA AAATTCATGT GGTTCCTATG TCAGACCAGG TAGTTGAACT TCTCACTACG
CTAAGCTCCA TCACCAAACC AGTCTCAGAG TTTGTTTTTG CCGGGCGCAA CGATAAGAAG
AAGCCAATCT GCGAGAACGC GGTACTGCTT GTGATCAAAC AAATCGGCTA TGAGGGTCTG
GAAAGCGGTC ACGGATTCAG GCATGAATTC AGCACGATTA TGAACGAGCA CGAATGGCCT
GCTGACGCTA TTGAAGTGCA ACTGGCACAT GCAAACGGCG GATCTGTGCG TGGGATTTAC
AACCATGCTC AGTATCTCGA TAAACGCAGA GAAATGATGC AATGGTGGGC GGACTGGCTT
GATGAGAAGG TGGAGTGA
 
Protein sequence
MLTVKQIEAA KPKEKPYRLL DGNGLYLYVP VSGKKVWQLR YKIDGKEKIL TVGKYPLMTL 
QEARDKAWTA RKDISVGIDP VKAKKASSNN NSFSAIYKEW YEHKKQVWSV GYATELAKMF
DDDILPIIGG LEIQDIEPMQ LLEVIRRFED RGAMERANKA RRRCGEVFRY AIVTGRAKYN
PAPDLADAMK GYRKKNFPFL PADQIPAFNK ALATFSGSIV SLIATKVLRY TALRTKELRS
MLWKNVDFEN RIITIDASVM KGRKIHVVPM SDQVVELLTT LSSITKPVSE FVFAGRNDKK
KPICENAVLL VIKQIGYEGL ESGHGFRHEF STIMNEHEWP ADAIEVQLAH ANGGSVRGIY
NHAQYLDKRR EMMQWWADWL DEKVE