Gene EcDH1_3926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3926 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4231395 
End bp4232687 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content33% 
IMG OID 
Productpentapeptide repeat protein 
Protein accessionACX41526 
Protein GI260451104 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTATA ATGGTTTAAA TAATATGTTT TTCCCTCTTT GCCTGATTAA CGATAACCAC 
TCTGTCACAA GTCCATCACA TACAAAGAAA ACAAAATCAG ATAATTACAG CAAACATCAT
AAAAACACGT TAATTGACAA TAAAGCCCTC TCTCTTTTCA AAATGGATGA TCATGAAAAA
GTGATAGGCT TGATTCAGAA AATGAAAAGA ATTTATGATA GTTTACCATC AGGAAAAATC
ACGAAAGAAA CGGACAGGAA AATACATAAA TATTTTATAG ATATAGCTTC ACATGCAAAT
AATAAATGTG ACGATAGAAT TACGAGAAGA GTTTACCTTA ATAAAGATAA GGAAGTGTCA
ATTAAGGTGG TATATTTTAT AAATAATGTC ACCGTCCATA ATAATACTAT CGAAATCCCA
CAGACAGTAA ATGGTGGTTA CGATTTTTCA CACCTTAGCC TGAAAGGTAT CGTGATTAAA
GATGAAGATT TATCCAATTC GAATTTTGCA GGTTGCAGAC TACAAAACGC TATTTTTCAA
GACTGTAATA TGTATAAAAC GAATTTTAAT TTCGCCATAA TGGAAAAAAT ACTTTTTGAT
AATTGTATTC TCGATGACTC AAATTTCGCT CAGATAAAAA TGACTGACGG AACTCTAAAT
TCATGTTCCG CTATGCATGT TCAATTCTAC AATGCAACAA TGAATAGAGC CAATATTAAA
AATACCTTCC TTGATTATTC AAATTTTTAT ATGGCATACA TGGCTGAGGT AAATCTTTAT
AAAGTAATAG CGCCATATAT TAATTTATTT AGAGCCGACC TTAGCTTCTC TAAACTTGAT
TTAATTAACT TTGAACATGC TGATCTGTCT CGTGTCAACC TGAATAAAGC AACCCTCCAG
AATATAAACT TAATTGATAG CAAACTCTTT TTTACGCGGT TAACAAATAC GTTCCTCGAA
ATGGTTATAT GTACCGACTC TAATATGGCT AATGTTAATT TTAATAATGC CAATTTAAGC
AATTGCCATT TCAACTGTTC TGTTTTAACA AAAGCCTGGA TGTTTAATAT CCGTCTCTAT
CGTGTTAATT TCGATGAGGC TAGCGTCCAG GGAATGGGTA TTACCATTCT CCGTGGTGAG
GAAAATATCT CCATTAATAG TGATATCCTG GTAACACTAC AGAAATTCTT TGAAGAAGAT
TGTGCCACTC ATACTGGCAT GTCACAAACT GAGGATAATC TTCATGCAGT CGCTATGAAG
ATTACTGCAG ATATTATGCA AGATGCAGAT TGA
 
Protein sequence
MRYNGLNNMF FPLCLINDNH SVTSPSHTKK TKSDNYSKHH KNTLIDNKAL SLFKMDDHEK 
VIGLIQKMKR IYDSLPSGKI TKETDRKIHK YFIDIASHAN NKCDDRITRR VYLNKDKEVS
IKVVYFINNV TVHNNTIEIP QTVNGGYDFS HLSLKGIVIK DEDLSNSNFA GCRLQNAIFQ
DCNMYKTNFN FAIMEKILFD NCILDDSNFA QIKMTDGTLN SCSAMHVQFY NATMNRANIK
NTFLDYSNFY MAYMAEVNLY KVIAPYINLF RADLSFSKLD LINFEHADLS RVNLNKATLQ
NINLIDSKLF FTRLTNTFLE MVICTDSNMA NVNFNNANLS NCHFNCSVLT KAWMFNIRLY
RVNFDEASVQ GMGITILRGE ENISINSDIL VTLQKFFEED CATHTGMSQT EDNLHAVAMK
ITADIMQDAD