Gene Daro_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1943 
Symbol 
ID3567872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2094187 
End bp2097045 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content56% 
IMG OID637680414 
ProductTPR repeat-containing protein 
Protein accessionYP_285159 
Protein GI71907572 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.102548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCACC CAAAATTACT CAAGCTGTCC GGCCTGGGAC TGATTAGGCA GGGGCTAGGC 
TTTGCCCTTC GCGCATCCGT AGCAGTCACT CTTACTGCTG CTCTTTCTGC TATCGCAGAG
GCAACACCTG AGAAAGCAGC GAGCTATTAC GAAGATGCAC TCCGCAAGTA TGAAAATAAC
GAGATGCCTG CGGCTGTCAT TCAGTTGAAA AACGCCATTC AGCAAGATCA GAAAATGCTT
GCAGCGCACC TGTTGCTCGG CAAAGCACTA CTTAAAAATG GCGATCTGAA GGGGGCTGAA
GCCGCATTTG AGGAGGCGCT CAAGCAGGGC GTCAATCGTG GAGAAGTCGC CCTGCCGCTT
GGGCAGATTT ATCTTGCACT AGGCCGCCCG GAAGCGGTTA TCGAAAAAAT ACCAGCTTCC
GGTTTGCCTC CCGCATTGCA GGTCGAAGTG CTGACCATGC GGGGAAATGC TTACTTGGAA
TCTGGCAAGA GCAGTCTTGC GGTCCAAAGT TTTGAGAATG CCAAGGCCAT TGATCCGAAG
TCGCCGCTGC CGCTCATCGC TGAAGTGCCG ATGTTGTTGG CTGCCGGAAG ACTCGATCAG
GCCAGGGAAA AGGCCAACAA GGCCGTTGAA CTGGCACCCA ATAACGGTTC GGCCTGGAAT
ATCAAAGCCT CGGTGCTGCA CGCTTCGTTT GATGCCAACG GAGCCCTGGC CGCTTATGAC
AAGGCGCTGA CGCTGGCGCC GAAACATGTT GATGCACGCA TCGCTCGTGC AGCATTGTTG
ATCGATCTGA AGCGCAATGC CGATGCTCAA AAAGATCTTG ATTACCTGAA AACCTTTGCC
GAAGATGAGC CTCGTGCGGC TTATCTTCGG GCTGTCTTGG CCAGTCTGCA CGGCGATGCC
AATGCGACAA ACGCAGCGCT TAAAGAGGTG ACGCGGACCG TCGACAGTCT TCCTCCTGCC
TGGTTGGCGC GGCGCGAGCA ATTGTTGATG GCGGCCGCTT TGGCGCATTA CGGCCTGGGT
AGCCACGAAA AGGCGCGTGA GTATCTGGAT GCGCTCATTG CGCGCAGCCC AAGTAATCTG
GGGGCAAAAA AATTACTGGC TTCGATTTAT GCCGATGCAA AGGATTACGG TCGTGCTCAG
ACTTTGCTCG AATCCCTGCA AAGAGCTACG CCGGACGACC CTCAGGTGAT GTACCTGCTG
GGAACGGTCA ATCTCGCCCA ACGACGCTAT GCGCAAGCGA CAGACCTTCT GGAAAAGGCT
GCAACTCGCA CCGGTTCCCC CGACATGAAC CGCTCGCTGG GCTTGAGCCA GCTTGGTCTG
GGACAGGCCG AGAAAGGGCT GGCCAGTCTG GAAAAGGCAT TTTCCGCCAA CCCGGCTGAT
TTTCGTGCTG GGATGGCCCT GGCTACGCTT TACATGCGCC AGGCCAAGAA AGACAAAGCG
ATGAAAACCG CGGAGGCAAT GGTCAAACAG GACTCGGCGA ATCTGACGGC ACTGAATTTT
CTGGGGACGA TCAAGGGCGC AAGTGGAGAC AAAGCAGGGG CGCGTAGTGC CTATTTGCAG
GTGCTTGCCA AAGATGCCGC TTTTGCCCCC TCTGTTCTGA ATCTGGTACG TCTGGACATC
GGTGAAAAGC GCTTCGATGA GGCACGTCGC CGTCTTGATG CTTTATTGAA AAAAGACAGT
AACGATTATC AGGTGCTGTT CGAGTACGGC CTGCTTGAAC AACGGGCTGA GCGCCCGGCC
GAAGCCATTC GTCATCTGAC CAAGGCGGGT GATGTCCAGC GTACTGATCC TGGTCCGACC
TTGGCGTTGA TTGATCTGTA CCTCAATCAA CGCCAAGGAG AACAGGCGCT CAAGGCAGCC
AAGGCCCTGG TTAGCAAGTT CTCGACAAGT CTGAGGGTGC AACAGGCCTT GGCACGGACC
TACCTGGCCA CAGGGGATGC TGTAAATGCC CGTAACGTAC TGACGACCGC AACGCGCCTA
GCTGAATTTG ATCCGAAGGC CCAGGTCTCG ATCGCGCGTA TGCAGTTGGC GGCCGCCAAC
CCTAACGGCG CTGCCTACAG CGTTTCCAAG GCCCTTCAGG GAAATCCGGA TGATGTCGCT
GCGCTCGCGC TGGCGGTCCA GGTTGAAGCG CGGCGCGGTG ATTCAGGCAA GGCTGATGTG
GCCCTCAGGA CGTTGACTTC AAAGCACCCG AACGACGTCG AGACGATTCG TGCCGGCGCA
GAATTGGCCA TGATGCGCGG CCAGTATCAG GCTGCAGTCA CCGGCTATCG CAAGCTTGTG
GCTCGTGAAG AAACCAGCGG CAATGCCCTG GCTTTGGTCG ACGCGCAGAC GAGGGCGGGT
GAGTCAGGAA AAGCTTCGGC CTTCCTCGAG GCCTGGGTAA AAACCCACCC CGAGGACCAG
CGGGCCCAGA AAGCGCTGGC CGATATCTTG TTCCGTGTGG GACAGCTTCC TGTGGCGAAA
CAGGCTTATC AAAAACTGCT TGCAGCCAAC CCTGATGATG CTGTTTCGCT AAATAACTAC
GCCAACTTGC TCTTGCAAAT GAACGATCCT TCTGCCCAGC AAGTCGCGGA AAAGGCGATC
AATCTTTCAC CTAATCATCC GGCCTATGCC GATACCCTGG GCTGGATTCT GGTGCACAAG
GAGCAACTTG AAACTGGGCT GAGGTACTTG CGAGAAGCAC GCTTGCGTAG CCCTGAAAAT
GGCGATATTC GCTTCCATCT TGCCTATGCC TTGGCCAAGG CAGGCAGACG CGATGAGGCG
AAGGAGGAAT TGCGGGCTGC CATCGCTAGC TCCGGCGAGT TCAAAGGGAC TGCTTTATTC
GGGCAACTTC GGCGGGAGTT GGGGGTAGTC AATGAATAG
 
Protein sequence
MMHPKLLKLS GLGLIRQGLG FALRASVAVT LTAALSAIAE ATPEKAASYY EDALRKYENN 
EMPAAVIQLK NAIQQDQKML AAHLLLGKAL LKNGDLKGAE AAFEEALKQG VNRGEVALPL
GQIYLALGRP EAVIEKIPAS GLPPALQVEV LTMRGNAYLE SGKSSLAVQS FENAKAIDPK
SPLPLIAEVP MLLAAGRLDQ AREKANKAVE LAPNNGSAWN IKASVLHASF DANGALAAYD
KALTLAPKHV DARIARAALL IDLKRNADAQ KDLDYLKTFA EDEPRAAYLR AVLASLHGDA
NATNAALKEV TRTVDSLPPA WLARREQLLM AAALAHYGLG SHEKAREYLD ALIARSPSNL
GAKKLLASIY ADAKDYGRAQ TLLESLQRAT PDDPQVMYLL GTVNLAQRRY AQATDLLEKA
ATRTGSPDMN RSLGLSQLGL GQAEKGLASL EKAFSANPAD FRAGMALATL YMRQAKKDKA
MKTAEAMVKQ DSANLTALNF LGTIKGASGD KAGARSAYLQ VLAKDAAFAP SVLNLVRLDI
GEKRFDEARR RLDALLKKDS NDYQVLFEYG LLEQRAERPA EAIRHLTKAG DVQRTDPGPT
LALIDLYLNQ RQGEQALKAA KALVSKFSTS LRVQQALART YLATGDAVNA RNVLTTATRL
AEFDPKAQVS IARMQLAAAN PNGAAYSVSK ALQGNPDDVA ALALAVQVEA RRGDSGKADV
ALRTLTSKHP NDVETIRAGA ELAMMRGQYQ AAVTGYRKLV AREETSGNAL ALVDAQTRAG
ESGKASAFLE AWVKTHPEDQ RAQKALADIL FRVGQLPVAK QAYQKLLAAN PDDAVSLNNY
ANLLLQMNDP SAQQVAEKAI NLSPNHPAYA DTLGWILVHK EQLETGLRYL REARLRSPEN
GDIRFHLAYA LAKAGRRDEA KEELRAAIAS SGEFKGTALF GQLRRELGVV NE