Gene EcDH1_4247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4247 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4612249 
End bp4613865 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content45% 
IMG OID 
Productporin LamB type 
Protein accessionACX41845 
Protein GI260451423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAGAC GAAATCTTAT TACCTCTGCC ATCTTATTAA TGGCACCGTT AGCCTTTTCT 
GCACAATCAT TGGCTGAATC ATTAACGGTG GAACAACGCC TTGAGTTATT AGAAAAGGCG
TTAAGAGAAA CGCAAAGCGA ACTCAAAAAG TATAAAGATG AAGAGAAGAA AAAGTATACG
CCAGCGACGG TGAATCGTAG CGTAAGTACG AATGATCAAG GGTATGCCGC CAATCCGTTC
CCGACCAGTA GTGCCGCAAA ACCTGATGCT GTACTGGTCA AAAATGAAGA GAAAAATGCC
AGTGAGACAG GCTCGATTTA TTCTTCCATG ACTCTGAAAG ATTTCAGTAA ATTTGTGAAA
GATGAAATTG GCTTTAGTTA CAACGGCTAT TACCGTTCTG GTTGGGGGAC CGCCTCTCAT
GGTTCACCTA AATCATGGGC GATTGGTTCT CTGGGCCGCT TTGGTAACGA ATACTCCGGC
TGGTTTGATT TGCAGTTAAA ACAACGTGTC TACAACGAAA ACGGCAAACG GGTTGATGCC
GTTGTGATGA TGGATGGTAA CGTTGGTCAG CAGTACTCTA CCGGCTGGTT TGGCGATAAC
GCCGGTGGCG AGAACTATAT GCAGTTCTCC GATATGTACG TTACCACCAA AGGTTTCCTG
CCCTTTGCGC CAGAGGCTGA TTTCTGGGTG GGTAAACACG GTGCGCCGAA AATTGAAATC
CAGATGCTTG ACTGGAAAAC GCAGCGTACT GATGCCGCAG CGGGTGTAGG TCTGGAAAAC
TGGAAAGTCG GTCCGGGTAA AATTGATATC GCGCTGGTTC GCGAAGATGT CGATGATTAC
GATCGCAGCC TGCAAAACAA ACAGCAGATT AATACCAATA CCATTGATTT ACGCTATAAA
GATATCCCGT TATGGGATAA AGCCACCTTA ATGGTAAGTG GTCGTTATGT CACGGCAAAC
GAAAGCGCAT CGGAAAAAGA TAATCAGGAT AATAACGGGT ATTATGACTG GAAAGATACC
TGGATGTTTG GCACATCTTT AACGCAGAAA TTTGATAAAG GTGGCTTCAA CGAATTCTCC
TTCCTGGTCG CGAATAACTC TATCGCCAGT AACTTTGGCC GTTATGCTGG CGCAAGTCCA
TTTACCACCT TTAATGGTCG TTATTATGGT GATCACACCG GCGGAACAGC GGTACGTCTG
ACTTCGCAGG GCGAAGCCTA TATTGGCGAT CATTTCATTG TAGCTAACGC GATTGTTTAC
TCCTTCGGTA ACGATATATA TAGCTACGAA ACAGGCGCCC ACTCTGATTT CGAATCTATT
CGTGCGGTTG TTCGCCCGGC CTATATTTGG GACCAATATA ACCAGACAGG TGTTGAACTG
GGCTATTTCA CCCAGCAAAA CAAAGATGCG AATAGTAATA AATTTAATGA GTCTGGTTAT
AAAACCACGC TCTTCCATAC CTTTAAAGTC AATACCAGTA TGTTGACCTC GCGTCCGGAA
ATTCGTTTCT ACGCCACGTA TATCAAAGCC CTGGAAAACG AACTGGATGG CTTCACCTTT
GAAGACAATA AAGACGACCA GTTTGCTGTC GGTGCCCAGG CTGAAATCTG GTGGTAA
 
Protein sequence
MFRRNLITSA ILLMAPLAFS AQSLAESLTV EQRLELLEKA LRETQSELKK YKDEEKKKYT 
PATVNRSVST NDQGYAANPF PTSSAAKPDA VLVKNEEKNA SETGSIYSSM TLKDFSKFVK
DEIGFSYNGY YRSGWGTASH GSPKSWAIGS LGRFGNEYSG WFDLQLKQRV YNENGKRVDA
VVMMDGNVGQ QYSTGWFGDN AGGENYMQFS DMYVTTKGFL PFAPEADFWV GKHGAPKIEI
QMLDWKTQRT DAAAGVGLEN WKVGPGKIDI ALVREDVDDY DRSLQNKQQI NTNTIDLRYK
DIPLWDKATL MVSGRYVTAN ESASEKDNQD NNGYYDWKDT WMFGTSLTQK FDKGGFNEFS
FLVANNSIAS NFGRYAGASP FTTFNGRYYG DHTGGTAVRL TSQGEAYIGD HFIVANAIVY
SFGNDIYSYE TGAHSDFESI RAVVRPAYIW DQYNQTGVEL GYFTQQNKDA NSNKFNESGY
KTTLFHTFKV NTSMLTSRPE IRFYATYIKA LENELDGFTF EDNKDDQFAV GAQAEIWW