Gene Daro_2439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2439 
Symbol 
ID3568382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2638062 
End bp2640833 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content52% 
IMG OID637680905 
ProductTPR repeat-containing protein 
Protein accessionYP_285644 
Protein GI71908057 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID[TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.427579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA ACCGTTTCTT CATTACGTCA GCTCTCACTA CTGCACTGCT CGCTGCATTC 
CTTGGGGGCT GCGGCGACTC CCCTGAATCG CTCATCGCCT CAAGCCGAGA GTTTCTCGCC
AAGAATGACA ATAAGGCGGC TGTTATTCAG TTAAAAAATG CCCTTCAACA AAATCCCAGT
CTTGGCGAAG CACGCTTTTT GCTCGGGAAA ACGCTTCTTG AGACTGGCGA TGCAGCGGGT
GCGGAAGTTG AGCTCCGCAA GGCCCAAGAT CTGAAATACT CCCCAGAACA AACAACTCCT
CTTCTGGCCA AGGCGATGCT TGGCGCTGGC CAAGCAAAGA AGTTAATCGA CGAATTCGGA
AAAACTGATT CGCTTAGCGG GGAATCTCTT GCTGCGCTCA AGACGACCTT GAGCGTTGCT
TACCTCATCC AAGGCAACCA AGATGCTGCT CAATCAGCAC TTTCCGATGC TCTAAAGGCA
CAACCGGACT TCGCTCCTGC CCTACTTTCC CTGGCTCGTA GCAAAGTCAC AAATCGAGAC
ATTGATGGCG CCCAAGCACT CGTCGCCCAA GTCTTGGAGA AGAACCCAAA AAATCACGAC
GCTTTGCTGT TGAATGGCTC CCTGCAAGGA GTCAAGTCGG GTCCCGAAGC CGCCTTAGCT
GAATATCGCA AGGCAATCGA GGCCAAGCCT GACTTTATTG CCGGGCACGC TGCGCTCATT
ACCACGCTAT TCCAGCAACA GAAATTCGAC GAGGCATCCA CCCAGTTGGA TGCCTTGAGG
AAAATTGCGC CGAAACATCC GCAAACGCTC TATCTGGATG CCCAAGCCAG CTACCAGCGG
AAGGATTTCC AAGGCGCACG CTCAAAACTT CAAGATCTAC TCAAGTTCAA CACAAACAAT
CCAACTGCAT TACAGCTTGC GGGTGCCGTC GAATTCCAAC TACGTTCGTA TATGCAGGCG
GAAACTTACC TGAATAAAGC CCTCTCGCAA GCCCCTGAAT TGCGCTTAGC ACGACGCATT
TTAGTTGCCA CCTATCTTCG TAACGGACAA GCAGCGAAAG CTCTGAACAC GCTCCAACCG
ATGCTTGACA AGGCAGATAC TGATTCAGCA CTACTAACGC TGGCAGGAGA GACCTATTTG
CAAAATGGGG ATGCCAAAAA AGCCGAAGAG TATTTTGCAA AGGCCAGCAA GCTTGATCCG
AACGACCCCG GGAAGAAAAC GTCAATTGCT TTGACCCACT TGGCTCAGGG CGATGTTTCT
GGTGCAGTTG AGGATCTGGA ACAAATTGCC CAAACCGATA AAGGCGTGAG GGCAGACCTC
GCGTTGATCT CCACTTTTAT CCGTACCAAC CAAGCCGACA AGGCGCTGAA AGCAATCGAC
AGCCTTGAAA AGAAACAGCC TGACAACCCT GCAACGCATA ACTTGCGCGC CCAGACGCTG
TTGTTAAAGA AAGATCTGGC TGGCGCCCGC TCAAGCTATG AGGCTGCGTT AAAGATCAAT
CCGGCCTTCT TCCCCGCAGC CGCCAGTCTC GCGAAAATCG ATCTGGCAGA AAAGAAGCCA
GACGACGCAA AAAAACGCTT TGAAAACGTA CTCCTTGCCG ACCCAAAGAG CGTCCCCGCG
TTACTTGCCC TTGCTGAATT AAAAGCTGCG AACAAAGGTT CTGTTGATGA AGTTGCCGGC
TTAATCGGGA AAGCCATCAC GAGCAATCCA ACAGAAATCA GCCCACGGCT CGCTCTGATC
CAGTATTACC TTAGCCAAAA AGAGACCAAA AAGGCCTTGG GCGCTGCGAA TGACGCCGCT
GCAGCAATTA AAGACAAGCC AGAAATCATA GATGCCCTCG GTCGTACCCA GCAGATGGCC
GGAGACCTGA ATCAGGCTTT GGCAAGCTAC ACCAAGCTCG CAGCACTTCA GCCAGCCTCA
CCATTGCCCT TGGTAAGGAC AGCAGACGTT CATCTCGCGA ACAAGAATAA GGATGAAGCA
GCAAAGAGTC TAAAAAAGGC ACTTGAGATC AAATCGGACC TGGTTGAAGC GCAGCGAGGT
CTAATTTTGT TGGCACTGGA CGCAAAAAAG CCGAACGAAG CGCTGCAGAT TGCCCAACAG
ATTGAAAAAC AACGCCCCAA GGAAGCCGTT GGTTTTGCGC TAGAAGGAGA CATCCACGCC
ACATCAAAGA ATTGGACTGA GGCGGCAAAT GCCTATCGTA ATGCAATAAA ACGAGCCGGC
AGCGCAGACA TTGCCATCAA GCTTCACTCC GTATTACTGG CATCGGGTAA CCCCCAAGAA
GCAGGACGGA TGGCAGCAGC ATGGCAGAAA GAACACCCCA AAGACATTGC ACTGCTCGTT
CATCAAGGGG ACGTAGCGAC CGCTCGCAAG GACTATTCAT TGGCGGCACA ACATTATCGT
CAAGCGCTCG ACATCCAGCC AAACAATGCG CTTGTCCTGA ACAATTTAGC CTGGGTATCC
GGGGAACTCA AAGCACCAAA AGCAATCGAG TATGCAGAAA AGGCCAACCA ACTGGCGCCA
GGGCAACCAC AATTCATGGA CACCCTCGCA ATGCTACTTG CGCAAAAAGG CGAAACCAAA
CGGGCCATTG ACCTCTTACG TAATGCTATG AACGCCGCGC CAAACGCCGC ATCGATTCAG
CTCAACCTCG CGAAGGTATT GATATCCGCC GGCGAGAAAA AAGAGGCCCG CAAGGAGCTT
GAGGCCCTTG CGAAGCTAGG CGACAAATTC TCCGGCCAAC CGGAGGTTGC TAAACTGCTG
CAGGCTCTTT GA
 
Protein sequence
MKKNRFFITS ALTTALLAAF LGGCGDSPES LIASSREFLA KNDNKAAVIQ LKNALQQNPS 
LGEARFLLGK TLLETGDAAG AEVELRKAQD LKYSPEQTTP LLAKAMLGAG QAKKLIDEFG
KTDSLSGESL AALKTTLSVA YLIQGNQDAA QSALSDALKA QPDFAPALLS LARSKVTNRD
IDGAQALVAQ VLEKNPKNHD ALLLNGSLQG VKSGPEAALA EYRKAIEAKP DFIAGHAALI
TTLFQQQKFD EASTQLDALR KIAPKHPQTL YLDAQASYQR KDFQGARSKL QDLLKFNTNN
PTALQLAGAV EFQLRSYMQA ETYLNKALSQ APELRLARRI LVATYLRNGQ AAKALNTLQP
MLDKADTDSA LLTLAGETYL QNGDAKKAEE YFAKASKLDP NDPGKKTSIA LTHLAQGDVS
GAVEDLEQIA QTDKGVRADL ALISTFIRTN QADKALKAID SLEKKQPDNP ATHNLRAQTL
LLKKDLAGAR SSYEAALKIN PAFFPAAASL AKIDLAEKKP DDAKKRFENV LLADPKSVPA
LLALAELKAA NKGSVDEVAG LIGKAITSNP TEISPRLALI QYYLSQKETK KALGAANDAA
AAIKDKPEII DALGRTQQMA GDLNQALASY TKLAALQPAS PLPLVRTADV HLANKNKDEA
AKSLKKALEI KSDLVEAQRG LILLALDAKK PNEALQIAQQ IEKQRPKEAV GFALEGDIHA
TSKNWTEAAN AYRNAIKRAG SADIAIKLHS VLLASGNPQE AGRMAAAWQK EHPKDIALLV
HQGDVATARK DYSLAAQHYR QALDIQPNNA LVLNNLAWVS GELKAPKAIE YAEKANQLAP
GQPQFMDTLA MLLAQKGETK RAIDLLRNAM NAAPNAASIQ LNLAKVLISA GEKKEARKEL
EALAKLGDKF SGQPEVAKLL QAL