Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daro_1943 |
Symbol | |
ID | 3567872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dechloromonas aromatica RCB |
Kingdom | Bacteria |
Replicon accession | NC_007298 |
Strand | + |
Start bp | 2094187 |
End bp | 2097045 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637680414 |
Product | TPR repeat-containing protein |
Protein accession | YP_285159 |
Protein GI | 71907572 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.102548 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCACC CAAAATTACT CAAGCTGTCC GGCCTGGGAC TGATTAGGCA GGGGCTAGGC TTTGCCCTTC GCGCATCCGT AGCAGTCACT CTTACTGCTG CTCTTTCTGC TATCGCAGAG GCAACACCTG AGAAAGCAGC GAGCTATTAC GAAGATGCAC TCCGCAAGTA TGAAAATAAC GAGATGCCTG CGGCTGTCAT TCAGTTGAAA AACGCCATTC AGCAAGATCA GAAAATGCTT GCAGCGCACC TGTTGCTCGG CAAAGCACTA CTTAAAAATG GCGATCTGAA GGGGGCTGAA GCCGCATTTG AGGAGGCGCT CAAGCAGGGC GTCAATCGTG GAGAAGTCGC CCTGCCGCTT GGGCAGATTT ATCTTGCACT AGGCCGCCCG GAAGCGGTTA TCGAAAAAAT ACCAGCTTCC GGTTTGCCTC CCGCATTGCA GGTCGAAGTG CTGACCATGC GGGGAAATGC TTACTTGGAA TCTGGCAAGA GCAGTCTTGC GGTCCAAAGT TTTGAGAATG CCAAGGCCAT TGATCCGAAG TCGCCGCTGC CGCTCATCGC TGAAGTGCCG ATGTTGTTGG CTGCCGGAAG ACTCGATCAG GCCAGGGAAA AGGCCAACAA GGCCGTTGAA CTGGCACCCA ATAACGGTTC GGCCTGGAAT ATCAAAGCCT CGGTGCTGCA CGCTTCGTTT GATGCCAACG GAGCCCTGGC CGCTTATGAC AAGGCGCTGA CGCTGGCGCC GAAACATGTT GATGCACGCA TCGCTCGTGC AGCATTGTTG ATCGATCTGA AGCGCAATGC CGATGCTCAA AAAGATCTTG ATTACCTGAA AACCTTTGCC GAAGATGAGC CTCGTGCGGC TTATCTTCGG GCTGTCTTGG CCAGTCTGCA CGGCGATGCC AATGCGACAA ACGCAGCGCT TAAAGAGGTG ACGCGGACCG TCGACAGTCT TCCTCCTGCC TGGTTGGCGC GGCGCGAGCA ATTGTTGATG GCGGCCGCTT TGGCGCATTA CGGCCTGGGT AGCCACGAAA AGGCGCGTGA GTATCTGGAT GCGCTCATTG CGCGCAGCCC AAGTAATCTG GGGGCAAAAA AATTACTGGC TTCGATTTAT GCCGATGCAA AGGATTACGG TCGTGCTCAG ACTTTGCTCG AATCCCTGCA AAGAGCTACG CCGGACGACC CTCAGGTGAT GTACCTGCTG GGAACGGTCA ATCTCGCCCA ACGACGCTAT GCGCAAGCGA CAGACCTTCT GGAAAAGGCT GCAACTCGCA CCGGTTCCCC CGACATGAAC CGCTCGCTGG GCTTGAGCCA GCTTGGTCTG GGACAGGCCG AGAAAGGGCT GGCCAGTCTG GAAAAGGCAT TTTCCGCCAA CCCGGCTGAT TTTCGTGCTG GGATGGCCCT GGCTACGCTT TACATGCGCC AGGCCAAGAA AGACAAAGCG ATGAAAACCG CGGAGGCAAT GGTCAAACAG GACTCGGCGA ATCTGACGGC ACTGAATTTT CTGGGGACGA TCAAGGGCGC AAGTGGAGAC AAAGCAGGGG CGCGTAGTGC CTATTTGCAG GTGCTTGCCA AAGATGCCGC TTTTGCCCCC TCTGTTCTGA ATCTGGTACG TCTGGACATC GGTGAAAAGC GCTTCGATGA GGCACGTCGC CGTCTTGATG CTTTATTGAA AAAAGACAGT AACGATTATC AGGTGCTGTT CGAGTACGGC CTGCTTGAAC AACGGGCTGA GCGCCCGGCC GAAGCCATTC GTCATCTGAC CAAGGCGGGT GATGTCCAGC GTACTGATCC TGGTCCGACC TTGGCGTTGA TTGATCTGTA CCTCAATCAA CGCCAAGGAG AACAGGCGCT CAAGGCAGCC AAGGCCCTGG TTAGCAAGTT CTCGACAAGT CTGAGGGTGC AACAGGCCTT GGCACGGACC TACCTGGCCA CAGGGGATGC TGTAAATGCC CGTAACGTAC TGACGACCGC AACGCGCCTA GCTGAATTTG ATCCGAAGGC CCAGGTCTCG ATCGCGCGTA TGCAGTTGGC GGCCGCCAAC CCTAACGGCG CTGCCTACAG CGTTTCCAAG GCCCTTCAGG GAAATCCGGA TGATGTCGCT GCGCTCGCGC TGGCGGTCCA GGTTGAAGCG CGGCGCGGTG ATTCAGGCAA GGCTGATGTG GCCCTCAGGA CGTTGACTTC AAAGCACCCG AACGACGTCG AGACGATTCG TGCCGGCGCA GAATTGGCCA TGATGCGCGG CCAGTATCAG GCTGCAGTCA CCGGCTATCG CAAGCTTGTG GCTCGTGAAG AAACCAGCGG CAATGCCCTG GCTTTGGTCG ACGCGCAGAC GAGGGCGGGT GAGTCAGGAA AAGCTTCGGC CTTCCTCGAG GCCTGGGTAA AAACCCACCC CGAGGACCAG CGGGCCCAGA AAGCGCTGGC CGATATCTTG TTCCGTGTGG GACAGCTTCC TGTGGCGAAA CAGGCTTATC AAAAACTGCT TGCAGCCAAC CCTGATGATG CTGTTTCGCT AAATAACTAC GCCAACTTGC TCTTGCAAAT GAACGATCCT TCTGCCCAGC AAGTCGCGGA AAAGGCGATC AATCTTTCAC CTAATCATCC GGCCTATGCC GATACCCTGG GCTGGATTCT GGTGCACAAG GAGCAACTTG AAACTGGGCT GAGGTACTTG CGAGAAGCAC GCTTGCGTAG CCCTGAAAAT GGCGATATTC GCTTCCATCT TGCCTATGCC TTGGCCAAGG CAGGCAGACG CGATGAGGCG AAGGAGGAAT TGCGGGCTGC CATCGCTAGC TCCGGCGAGT TCAAAGGGAC TGCTTTATTC GGGCAACTTC GGCGGGAGTT GGGGGTAGTC AATGAATAG
|
Protein sequence | MMHPKLLKLS GLGLIRQGLG FALRASVAVT LTAALSAIAE ATPEKAASYY EDALRKYENN EMPAAVIQLK NAIQQDQKML AAHLLLGKAL LKNGDLKGAE AAFEEALKQG VNRGEVALPL GQIYLALGRP EAVIEKIPAS GLPPALQVEV LTMRGNAYLE SGKSSLAVQS FENAKAIDPK SPLPLIAEVP MLLAAGRLDQ AREKANKAVE LAPNNGSAWN IKASVLHASF DANGALAAYD KALTLAPKHV DARIARAALL IDLKRNADAQ KDLDYLKTFA EDEPRAAYLR AVLASLHGDA NATNAALKEV TRTVDSLPPA WLARREQLLM AAALAHYGLG SHEKAREYLD ALIARSPSNL GAKKLLASIY ADAKDYGRAQ TLLESLQRAT PDDPQVMYLL GTVNLAQRRY AQATDLLEKA ATRTGSPDMN RSLGLSQLGL GQAEKGLASL EKAFSANPAD FRAGMALATL YMRQAKKDKA MKTAEAMVKQ DSANLTALNF LGTIKGASGD KAGARSAYLQ VLAKDAAFAP SVLNLVRLDI GEKRFDEARR RLDALLKKDS NDYQVLFEYG LLEQRAERPA EAIRHLTKAG DVQRTDPGPT LALIDLYLNQ RQGEQALKAA KALVSKFSTS LRVQQALART YLATGDAVNA RNVLTTATRL AEFDPKAQVS IARMQLAAAN PNGAAYSVSK ALQGNPDDVA ALALAVQVEA RRGDSGKADV ALRTLTSKHP NDVETIRAGA ELAMMRGQYQ AAVTGYRKLV AREETSGNAL ALVDAQTRAG ESGKASAFLE AWVKTHPEDQ RAQKALADIL FRVGQLPVAK QAYQKLLAAN PDDAVSLNNY ANLLLQMNDP SAQQVAEKAI NLSPNHPAYA DTLGWILVHK EQLETGLRYL REARLRSPEN GDIRFHLAYA LAKAGRRDEA KEELRAAIAS SGEFKGTALF GQLRRELGVV NE
|
| |