Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DET1003 |
Symbol | |
ID | 3229730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides ethenogenes 195 |
Kingdom | Bacteria |
Replicon accession | NC_002936 |
Strand | - |
Start bp | 915785 |
End bp | 916762 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637120567 |
Product | hypothetical protein |
Protein accession | YP_181723 |
Protein GI | 57234263 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.305983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTATATG TTTTTAAGGA ACTCAGCCTT GAGGAATTAA GCTCTGATTT TGCCTCCGGC CAAAACGGGC TTAATTTTGT TTATCCCTTT TTGCACCCCG GCTGGCAGTC TGTCTGGCAG AAGTGTTTTT CTCCGCCCGG CAGGCAAGTT TGCGGGCTGG TGACTTTGGG GTCTGAGACG GTGGGGCTTA TAGCCTTGCG GATAGACAGC GGGGTTGCCC GTTTTATAGC TGACGGAAAT GTTTTTGATT ATCTGGATTT TGCGGTAAAA CCCGGCCATC AGGCCGGATA TTTTGAGCTG CTGCTTAAGC TTTTGAAGCA AAACCAGGTT GAGGTGCTGG AACTGGAGGG GCTTACAGCC CGTTCCCAGG CTTATGACGT TCTGCTGCCG CTGGCGAAAG AAAAAGGTAT TTTGGTAGTT TGCGAACAGG CAGACGTTTC TCCGCTGCTT GAGCTGCCGG CTGACTTTGA AAAATACCTG GCAGGGCTGG AAAAGCACCA GCGCCACGAA CTTAAACGCA AATTACGCCG TCTTGAGGAA ACTCTGACAC CCCGGCTGGA AGTAGTGACC AGCCCTGGTG ATATAGATAT TTTGCTTGAC CAGATGGAGC TCAGCCACCT TGAGAAAGCA ATTTTCCTTA ATCCGGAAAT GCGCTGCTAT TTTAAGCAGC TGGCAGACTG GCTTGGCAGT CAGGGCTATC TGCGTCTTGT TTTTCTCAAA ACCGGGGAGA CTGTTTTGGC AAGTCTGTTT TGCTTTGATT ATAATAATAT AAGATACTTG TATAACAGCG GCTACAATCC GGAGTATTCC CACCTTAGTG TGGGCGTGCT TTCCAAAATG CTGGCTATTA AGGATTCTAT TGAAAAAGGC TATGATGCCT TTGATTTTCT GCGCGGAGAG GAAAAATATA AATTTCATCT GGGCGGGAAA TCCCAGCCGG TTTACCGCTG CTGCATAACT CTGGATAGTC AGGTATAA
|
Protein sequence | MVYVFKELSL EELSSDFASG QNGLNFVYPF LHPGWQSVWQ KCFSPPGRQV CGLVTLGSET VGLIALRIDS GVARFIADGN VFDYLDFAVK PGHQAGYFEL LLKLLKQNQV EVLELEGLTA RSQAYDVLLP LAKEKGILVV CEQADVSPLL ELPADFEKYL AGLEKHQRHE LKRKLRRLEE TLTPRLEVVT SPGDIDILLD QMELSHLEKA IFLNPEMRCY FKQLADWLGS QGYLRLVFLK TGETVLASLF CFDYNNIRYL YNSGYNPEYS HLSVGVLSKM LAIKDSIEKG YDAFDFLRGE EKYKFHLGGK SQPVYRCCIT LDSQV
|
| |