Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daro_2439 |
Symbol | |
ID | 3568382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dechloromonas aromatica RCB |
Kingdom | Bacteria |
Replicon accession | NC_007298 |
Strand | + |
Start bp | 2638062 |
End bp | 2640833 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637680905 |
Product | TPR repeat-containing protein |
Protein accession | YP_285644 |
Protein GI | 71908057 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.427579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA ACCGTTTCTT CATTACGTCA GCTCTCACTA CTGCACTGCT CGCTGCATTC CTTGGGGGCT GCGGCGACTC CCCTGAATCG CTCATCGCCT CAAGCCGAGA GTTTCTCGCC AAGAATGACA ATAAGGCGGC TGTTATTCAG TTAAAAAATG CCCTTCAACA AAATCCCAGT CTTGGCGAAG CACGCTTTTT GCTCGGGAAA ACGCTTCTTG AGACTGGCGA TGCAGCGGGT GCGGAAGTTG AGCTCCGCAA GGCCCAAGAT CTGAAATACT CCCCAGAACA AACAACTCCT CTTCTGGCCA AGGCGATGCT TGGCGCTGGC CAAGCAAAGA AGTTAATCGA CGAATTCGGA AAAACTGATT CGCTTAGCGG GGAATCTCTT GCTGCGCTCA AGACGACCTT GAGCGTTGCT TACCTCATCC AAGGCAACCA AGATGCTGCT CAATCAGCAC TTTCCGATGC TCTAAAGGCA CAACCGGACT TCGCTCCTGC CCTACTTTCC CTGGCTCGTA GCAAAGTCAC AAATCGAGAC ATTGATGGCG CCCAAGCACT CGTCGCCCAA GTCTTGGAGA AGAACCCAAA AAATCACGAC GCTTTGCTGT TGAATGGCTC CCTGCAAGGA GTCAAGTCGG GTCCCGAAGC CGCCTTAGCT GAATATCGCA AGGCAATCGA GGCCAAGCCT GACTTTATTG CCGGGCACGC TGCGCTCATT ACCACGCTAT TCCAGCAACA GAAATTCGAC GAGGCATCCA CCCAGTTGGA TGCCTTGAGG AAAATTGCGC CGAAACATCC GCAAACGCTC TATCTGGATG CCCAAGCCAG CTACCAGCGG AAGGATTTCC AAGGCGCACG CTCAAAACTT CAAGATCTAC TCAAGTTCAA CACAAACAAT CCAACTGCAT TACAGCTTGC GGGTGCCGTC GAATTCCAAC TACGTTCGTA TATGCAGGCG GAAACTTACC TGAATAAAGC CCTCTCGCAA GCCCCTGAAT TGCGCTTAGC ACGACGCATT TTAGTTGCCA CCTATCTTCG TAACGGACAA GCAGCGAAAG CTCTGAACAC GCTCCAACCG ATGCTTGACA AGGCAGATAC TGATTCAGCA CTACTAACGC TGGCAGGAGA GACCTATTTG CAAAATGGGG ATGCCAAAAA AGCCGAAGAG TATTTTGCAA AGGCCAGCAA GCTTGATCCG AACGACCCCG GGAAGAAAAC GTCAATTGCT TTGACCCACT TGGCTCAGGG CGATGTTTCT GGTGCAGTTG AGGATCTGGA ACAAATTGCC CAAACCGATA AAGGCGTGAG GGCAGACCTC GCGTTGATCT CCACTTTTAT CCGTACCAAC CAAGCCGACA AGGCGCTGAA AGCAATCGAC AGCCTTGAAA AGAAACAGCC TGACAACCCT GCAACGCATA ACTTGCGCGC CCAGACGCTG TTGTTAAAGA AAGATCTGGC TGGCGCCCGC TCAAGCTATG AGGCTGCGTT AAAGATCAAT CCGGCCTTCT TCCCCGCAGC CGCCAGTCTC GCGAAAATCG ATCTGGCAGA AAAGAAGCCA GACGACGCAA AAAAACGCTT TGAAAACGTA CTCCTTGCCG ACCCAAAGAG CGTCCCCGCG TTACTTGCCC TTGCTGAATT AAAAGCTGCG AACAAAGGTT CTGTTGATGA AGTTGCCGGC TTAATCGGGA AAGCCATCAC GAGCAATCCA ACAGAAATCA GCCCACGGCT CGCTCTGATC CAGTATTACC TTAGCCAAAA AGAGACCAAA AAGGCCTTGG GCGCTGCGAA TGACGCCGCT GCAGCAATTA AAGACAAGCC AGAAATCATA GATGCCCTCG GTCGTACCCA GCAGATGGCC GGAGACCTGA ATCAGGCTTT GGCAAGCTAC ACCAAGCTCG CAGCACTTCA GCCAGCCTCA CCATTGCCCT TGGTAAGGAC AGCAGACGTT CATCTCGCGA ACAAGAATAA GGATGAAGCA GCAAAGAGTC TAAAAAAGGC ACTTGAGATC AAATCGGACC TGGTTGAAGC GCAGCGAGGT CTAATTTTGT TGGCACTGGA CGCAAAAAAG CCGAACGAAG CGCTGCAGAT TGCCCAACAG ATTGAAAAAC AACGCCCCAA GGAAGCCGTT GGTTTTGCGC TAGAAGGAGA CATCCACGCC ACATCAAAGA ATTGGACTGA GGCGGCAAAT GCCTATCGTA ATGCAATAAA ACGAGCCGGC AGCGCAGACA TTGCCATCAA GCTTCACTCC GTATTACTGG CATCGGGTAA CCCCCAAGAA GCAGGACGGA TGGCAGCAGC ATGGCAGAAA GAACACCCCA AAGACATTGC ACTGCTCGTT CATCAAGGGG ACGTAGCGAC CGCTCGCAAG GACTATTCAT TGGCGGCACA ACATTATCGT CAAGCGCTCG ACATCCAGCC AAACAATGCG CTTGTCCTGA ACAATTTAGC CTGGGTATCC GGGGAACTCA AAGCACCAAA AGCAATCGAG TATGCAGAAA AGGCCAACCA ACTGGCGCCA GGGCAACCAC AATTCATGGA CACCCTCGCA ATGCTACTTG CGCAAAAAGG CGAAACCAAA CGGGCCATTG ACCTCTTACG TAATGCTATG AACGCCGCGC CAAACGCCGC ATCGATTCAG CTCAACCTCG CGAAGGTATT GATATCCGCC GGCGAGAAAA AAGAGGCCCG CAAGGAGCTT GAGGCCCTTG CGAAGCTAGG CGACAAATTC TCCGGCCAAC CGGAGGTTGC TAAACTGCTG CAGGCTCTTT GA
|
Protein sequence | MKKNRFFITS ALTTALLAAF LGGCGDSPES LIASSREFLA KNDNKAAVIQ LKNALQQNPS LGEARFLLGK TLLETGDAAG AEVELRKAQD LKYSPEQTTP LLAKAMLGAG QAKKLIDEFG KTDSLSGESL AALKTTLSVA YLIQGNQDAA QSALSDALKA QPDFAPALLS LARSKVTNRD IDGAQALVAQ VLEKNPKNHD ALLLNGSLQG VKSGPEAALA EYRKAIEAKP DFIAGHAALI TTLFQQQKFD EASTQLDALR KIAPKHPQTL YLDAQASYQR KDFQGARSKL QDLLKFNTNN PTALQLAGAV EFQLRSYMQA ETYLNKALSQ APELRLARRI LVATYLRNGQ AAKALNTLQP MLDKADTDSA LLTLAGETYL QNGDAKKAEE YFAKASKLDP NDPGKKTSIA LTHLAQGDVS GAVEDLEQIA QTDKGVRADL ALISTFIRTN QADKALKAID SLEKKQPDNP ATHNLRAQTL LLKKDLAGAR SSYEAALKIN PAFFPAAASL AKIDLAEKKP DDAKKRFENV LLADPKSVPA LLALAELKAA NKGSVDEVAG LIGKAITSNP TEISPRLALI QYYLSQKETK KALGAANDAA AAIKDKPEII DALGRTQQMA GDLNQALASY TKLAALQPAS PLPLVRTADV HLANKNKDEA AKSLKKALEI KSDLVEAQRG LILLALDAKK PNEALQIAQQ IEKQRPKEAV GFALEGDIHA TSKNWTEAAN AYRNAIKRAG SADIAIKLHS VLLASGNPQE AGRMAAAWQK EHPKDIALLV HQGDVATARK DYSLAAQHYR QALDIQPNNA LVLNNLAWVS GELKAPKAIE YAEKANQLAP GQPQFMDTLA MLLAQKGETK RAIDLLRNAM NAAPNAASIQ LNLAKVLISA GEKKEARKEL EALAKLGDKF SGQPEVAKLL QAL
|
| |