Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DET1078 |
Symbol | |
ID | 3229641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides ethenogenes 195 |
Kingdom | Bacteria |
Replicon accession | NC_002936 |
Strand | - |
Start bp | 981225 |
End bp | 983948 |
Gene Length | 2724 bp |
Protein Length | 907 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637120642 |
Product | TP901 family tail tape measure protein |
Protein accession | YP_181793 |
Protein GI | 57234174 |
COG category | [S] Function unknown |
COG ID | [COG5280] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGACA GGATAAAAGG CATAACCGTG GAAATCGGCG GCGATACGAC CGGCCTTTCC AAAGCGCTCT CCGACGTAAA CAAGGAAATC AAAAACACGC AGTCGCAGCT TAAAGACGTC AATAAGCTCC TAAAGCTCGA CCCGACGAAT ACCACGCTGC TTGAGCAGAA ACAGAAGCTC CTTAAACAGG CTGTCTCCGA AACGAAAGAT AAGCTCACAC AGCTGAAGTC CGTGCAAGAC CAGATGGATG CTGGACTCAA AAACGGTACC GTCACCCAGC AGCAATACGA TGCATGGCAG CGTGAGATCA TAGAGACAGA AAACGAGCTT AGAAACCTCG AACAGCAGTG CAGAGAAACA GACTCCCATA TCTCAGCTAC CTTAAAGCAG ACCGGAAGCA AGCTGCAGGA AGTCGGCGGC AAGATATCCA GTGTGGGCAC AGGACTGACC ACGCATGTCA CGGCTCCGAT TATGGCCATC GGCGCAGCTT CCCTTGCTGC CTTTAATGAA GTGGATGCAG GGCTTGATAT CGTGGCGCAG AAAACCGGCG CTACGGGAAA AGCTCTGGAA GACATGAACC AGATCGTCAA AGACCTCGCC ACAGAGATAC CGACGGACTT CGAAACTGCC GGTGCCGCTG TCGGCGAGGT CAACACCCGC TTCGGCTTAA CCGGGCAGGC GCTTGATGAT CTCTCTGCAA AATTCATAAA GTTTGCCCAG CTCAACGACA CCGATGTTTC GACATCTATC GACAACGTGT CCTCGGTTAT GAACGCCTTC GGTATGGACG CATCTGAAGC AGACTCCCTT CTTGATGCAT TAAACGCCAC CGGTCAGGCG ACCGGCATTG ATATGGATAC CCTCGCGGGC GCTCTTTCCT CCAATGCCAT CCAGCTAAAG GAAATGGGAC TGACCGCCCA GCAGGCTGCC GGTTTCATGG GCATGGTGGA AATGTCCGGT CTTGATACCT CATCTGCCAT GATGGGTCTT AAGACCGCCA TGAAGAATGC GACGAAGGAC GGCAAAACGC TGGATCAGGC GCTTGCTGAT TTCTCCCAGA CCATGAAAGG CAACGGCTCT GAAACGGAAA AGCTGCAGGC GGCCTATGAC CTTTTCGGAA GCAAGGCAGG TGCGTCGATT TACAATGCCG TCCAGACCGG GAAACTGAGT CTTGATGACC TTGCCGGTTC CCTCGGTGAC TTTGAGGGAA GTGTCGAGAA CACCTTCAAC GAGACCCTCG ACCCGATTGA CCAGTTCAAG ATGACGATGA ACTCCCTGAA GGAAACCGGC GCGGAAATCG GAAACACCCT CGCTACCGTT CTGGCTCCGG TCTTAAAGGA CATCTCCGCA GCCCTGAAGG CATTTGCTGA AATGTGGAGC AAGATTCCGG CTCCGGTTCA GCAGACGATT GTAAAGATCG CCCTTGTGGC TGCGGCTATC GGCCCGATTC TGGTGGTGGT CGGAAAGATC ATCTCGGCGG TCGGCACGAT TATGACGATC ATACCGCAGG TTTCTGCTGC AATCGGTGTG GTAAAAGGCG CGATGGCAGC CCTGAACGCG ACCATGCTGG CCAATCCTAT CGTCCTAATC ATCGCTGCGA TTGCGGCGCT GGTGGCTGCC TTCATCTATC TTTGGAACAC GAACGAGGGC TTTAGGCAGT TCTGGATCGA CCTGTGGGAA AACATCAAGC AGGCAGTCGT TACGGCATGG GAGGCGATCA AGAGCTTCTT CTCCACTGTC TGGGAGACCA TCAAGGGAAT CTTCGAAGCT GCGGTGAACG GCATCAGCAC CTTCCTCACA AATGCATGGA TGGCGATCAC GACCACGGTG CAGACGGTTT TTAATGCCAT AAAGACTTTC TTTGAAACGA TCTGGAACGC CATAAAGACT GTTTTTGAGA CCGTGTTCAA TGTGATAAAA ACTATCGTCA CCACCTACTT CAATATCTAC AAGACGATCA TCGAGACCGT CCTGAATGTG ATAAAGGCAG TGGTCACGAC GGTATGGAAC GCAATAAAAA CCGTGGTCAC AACAGTCGTC ACGGCAATCC AGACCTTCAT CACCACGGCT TGGAATGCCA TAAAGACAGC GGTCACTACT GTGATGAATG CCATAAAAAC TGTGGTATCT ACGGTCTGGA ACGGGATAAA GACCACCATC ATGACTGTGG TAAATACCGT GAAAAACGGC ATCACCACAG CCTTCAACGC CATAAAAAAT ACGATATCGA ATGTCCTAAA CGGCATCAAG AATACCGTCT CTAATGTGTT CAACGGGATC TGGAACTTTA TTTCCGGCAT CGTGAACAAG CTGAAGAACG TATTCAACTT CCACTGGGAG CTGCCAAAGA TCAAACTGCC GCACTTCTCC ATCTCCGGGA GCTTCTCTCT AAACCCGCCG TCTATCCCGC ATTTTTCTGT GGAATGGTAC AAGAAGGCGA TGGGAAACGG CATGATCCTC GATTCACCGA CCATCTTCGG CATGAGCGGG AATACGCTCC TTGGCGCAGG AGAAGCCGGT GCGGAGGCTA TTGTCGGTGT TGACTCCCTG CGCGGCATGA TTCAGGATGC AGTGGCCGGA CAGACCTCGG CTATCGTTAC TGCTCTTGCA GGTGTCGGCG GCGGAGGCGA TATCACCATC CCGGTTTATC TTGGAGGCAC GCTGCTTGAC GAGACGATTG TCACAGCTCA GCAGCGAATG GCGCTCCGGT CAGGAGGCAG ATGA
|
Protein sequence | MADRIKGITV EIGGDTTGLS KALSDVNKEI KNTQSQLKDV NKLLKLDPTN TTLLEQKQKL LKQAVSETKD KLTQLKSVQD QMDAGLKNGT VTQQQYDAWQ REIIETENEL RNLEQQCRET DSHISATLKQ TGSKLQEVGG KISSVGTGLT THVTAPIMAI GAASLAAFNE VDAGLDIVAQ KTGATGKALE DMNQIVKDLA TEIPTDFETA GAAVGEVNTR FGLTGQALDD LSAKFIKFAQ LNDTDVSTSI DNVSSVMNAF GMDASEADSL LDALNATGQA TGIDMDTLAG ALSSNAIQLK EMGLTAQQAA GFMGMVEMSG LDTSSAMMGL KTAMKNATKD GKTLDQALAD FSQTMKGNGS ETEKLQAAYD LFGSKAGASI YNAVQTGKLS LDDLAGSLGD FEGSVENTFN ETLDPIDQFK MTMNSLKETG AEIGNTLATV LAPVLKDISA ALKAFAEMWS KIPAPVQQTI VKIALVAAAI GPILVVVGKI ISAVGTIMTI IPQVSAAIGV VKGAMAALNA TMLANPIVLI IAAIAALVAA FIYLWNTNEG FRQFWIDLWE NIKQAVVTAW EAIKSFFSTV WETIKGIFEA AVNGISTFLT NAWMAITTTV QTVFNAIKTF FETIWNAIKT VFETVFNVIK TIVTTYFNIY KTIIETVLNV IKAVVTTVWN AIKTVVTTVV TAIQTFITTA WNAIKTAVTT VMNAIKTVVS TVWNGIKTTI MTVVNTVKNG ITTAFNAIKN TISNVLNGIK NTVSNVFNGI WNFISGIVNK LKNVFNFHWE LPKIKLPHFS ISGSFSLNPP SIPHFSVEWY KKAMGNGMIL DSPTIFGMSG NTLLGAGEAG AEAIVGVDSL RGMIQDAVAG QTSAIVTALA GVGGGGDITI PVYLGGTLLD ETIVTAQQRM ALRSGGR
|
| |