Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3926 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4231395 |
End bp | 4232687 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | |
Product | pentapeptide repeat protein |
Protein accession | ACX41526 |
Protein GI | 260451104 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTATA ATGGTTTAAA TAATATGTTT TTCCCTCTTT GCCTGATTAA CGATAACCAC TCTGTCACAA GTCCATCACA TACAAAGAAA ACAAAATCAG ATAATTACAG CAAACATCAT AAAAACACGT TAATTGACAA TAAAGCCCTC TCTCTTTTCA AAATGGATGA TCATGAAAAA GTGATAGGCT TGATTCAGAA AATGAAAAGA ATTTATGATA GTTTACCATC AGGAAAAATC ACGAAAGAAA CGGACAGGAA AATACATAAA TATTTTATAG ATATAGCTTC ACATGCAAAT AATAAATGTG ACGATAGAAT TACGAGAAGA GTTTACCTTA ATAAAGATAA GGAAGTGTCA ATTAAGGTGG TATATTTTAT AAATAATGTC ACCGTCCATA ATAATACTAT CGAAATCCCA CAGACAGTAA ATGGTGGTTA CGATTTTTCA CACCTTAGCC TGAAAGGTAT CGTGATTAAA GATGAAGATT TATCCAATTC GAATTTTGCA GGTTGCAGAC TACAAAACGC TATTTTTCAA GACTGTAATA TGTATAAAAC GAATTTTAAT TTCGCCATAA TGGAAAAAAT ACTTTTTGAT AATTGTATTC TCGATGACTC AAATTTCGCT CAGATAAAAA TGACTGACGG AACTCTAAAT TCATGTTCCG CTATGCATGT TCAATTCTAC AATGCAACAA TGAATAGAGC CAATATTAAA AATACCTTCC TTGATTATTC AAATTTTTAT ATGGCATACA TGGCTGAGGT AAATCTTTAT AAAGTAATAG CGCCATATAT TAATTTATTT AGAGCCGACC TTAGCTTCTC TAAACTTGAT TTAATTAACT TTGAACATGC TGATCTGTCT CGTGTCAACC TGAATAAAGC AACCCTCCAG AATATAAACT TAATTGATAG CAAACTCTTT TTTACGCGGT TAACAAATAC GTTCCTCGAA ATGGTTATAT GTACCGACTC TAATATGGCT AATGTTAATT TTAATAATGC CAATTTAAGC AATTGCCATT TCAACTGTTC TGTTTTAACA AAAGCCTGGA TGTTTAATAT CCGTCTCTAT CGTGTTAATT TCGATGAGGC TAGCGTCCAG GGAATGGGTA TTACCATTCT CCGTGGTGAG GAAAATATCT CCATTAATAG TGATATCCTG GTAACACTAC AGAAATTCTT TGAAGAAGAT TGTGCCACTC ATACTGGCAT GTCACAAACT GAGGATAATC TTCATGCAGT CGCTATGAAG ATTACTGCAG ATATTATGCA AGATGCAGAT TGA
|
Protein sequence | MRYNGLNNMF FPLCLINDNH SVTSPSHTKK TKSDNYSKHH KNTLIDNKAL SLFKMDDHEK VIGLIQKMKR IYDSLPSGKI TKETDRKIHK YFIDIASHAN NKCDDRITRR VYLNKDKEVS IKVVYFINNV TVHNNTIEIP QTVNGGYDFS HLSLKGIVIK DEDLSNSNFA GCRLQNAIFQ DCNMYKTNFN FAIMEKILFD NCILDDSNFA QIKMTDGTLN SCSAMHVQFY NATMNRANIK NTFLDYSNFY MAYMAEVNLY KVIAPYINLF RADLSFSKLD LINFEHADLS RVNLNKATLQ NINLIDSKLF FTRLTNTFLE MVICTDSNMA NVNFNNANLS NCHFNCSVLT KAWMFNIRLY RVNFDEASVQ GMGITILRGE ENISINSDIL VTLQKFFEED CATHTGMSQT EDNLHAVAMK ITADIMQDAD
|
| |