Gene Daro_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2519 
Symbol 
ID3567553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2716080 
End bp2718017 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content61% 
IMG OID637680986 
Producttransglutaminase-like 
Protein accessionYP_285722 
Protein GI71908135 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.135415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCC AAGCCAGCGA AGCGCTCGAC CGCCGCGCCA CACCATGGTT GTTCGCCACG 
GCACTGGTCA CCACGGCACC GCACATGCTG CATCAACCAC CGTGGCTCAG CGCTCTGGCC
GGGATGCTGC TGCTCTGGGC CACCTGGCTA TGGTGGCGGG ACCAGCGCCT GCCGGGACGC
TGGATACTCC TGCCGCTGGT CGGCGCCGGC TGCGCCGGAA TCCTCATTGA ATTCCACACC
CTGTTCGGCC GTGACGCCGG TGTCGCCATG CTGGTCATCT TCACGACCAT GAAGTTGCTC
GAATTGAAAT CGCGGCGCGA TGCAATGGTG CTTGTCACGC TCGGTTATTT CCTGCTGCTG
ACCCACTACT TATACTCACA GAGCATTCCC ACCGGTCTCT GGCTGCTGGC CTGCCTGTGG
CTGGTCACCG CTACATTGAT CCGCCTGCAT GGCGGCCCGG CCAGCAACCT TCGCCACAGC
TTGCGCTATG CAGCCCTGCT CTGCCTGCAG GCAGTTCCAT TCATGTTGGC GCTGTACCTA
TTATTCCCCC GCATTTCAGG ACCGCTCTGG GGGCTACCGT CAGATGCCCA CGCCGGCATG
ACCGGACTAT CGGACACTAT GTCGCCCGGT AGCTTCTCCC AACTTGCTCA AAGCGCTGAT
ATCGCCTTTC GCGTTCGCTT CGACGGACCA CTCCCCCCCA AGCAGAAGCT CTACTGGCGC
GGCCCGGTCA TGGAAAACTT CGACGGCACA ACCTGGCGTC GCCATGAAGG CAGCCAGCCG
CCTGAACGGG TCGAAAGCCT GTCACTGCCA ATCCCTTATG AAACCACGCT GGAAGCCCAC
AACCAGCGCT GGTTGCTGGC TCTTGAAGCA CCAACCGGCC TGCCACCTGA AACCACACTG
AACGGCACCC TGAGTGCAAG CAGCCGCGGC GCAATCACCG AACGACAACG TTTCAGGCTT
TCCGCAACGC TGGATTACCG CTTCAACAGC ACCGAAGACC CACGGATCAT CCAGCGCAAT
CTTGCCTTAC CCGAGGGACT CAACCCGAAA ACCCGCGCCC TGGCCAAGCA ATGGCAAGCC
TCGGGCATTC CACAGCAAGC CATTATTGAC AAGGCACTTG ACCTGTTCGC CAGCGAATTC
ACCTACACCT TGCGACCGCC ACTACTTGGT CGGAACGGCA TCGACGAATT CCTTTTCCAA
AGTCATCGTG GCTTCTGCGA ACACTACGCA GCGGCCTTCA TCATCCTGAT GCGGTCCGCC
GGCATACCGG CCCGTGTTGT TGGTGGTTAT CAGGGCGGCG AATACAACCC GCTCGACGGC
TATCTCGTCG TCCGTCAGTC CGACGCCCAC GCCTGGGCAG AAGTCTGGAT CAGCGGTCGA
GGCTGGATTC GCGTTGATCC GACGGCTGCT GTTTCACCAA ACCGTATTGA AACCGGCATC
GCCGATGCCC TGCCCTTTGG CGAGCCCTTG CCTGCACTTG TCCAGTGGCG GGCCGAGTGG
ATACGCGGCC TGCGCTATCG CTGGGAGGCC ATCAACAACA CCTGGAATCA GAACGTTCTC
GGCTATGACC CGCAGCGCCA GCGCGAGTTG CTTTCCCGCC TGGGTCTAGC CGATACCGAT
TGGCGCAGCC TCGTCACCCT ATTGGGAATA ATTTGCAGCC TGCTGGTCGC CGCCATAACG
GCCTGGACGA TCTATCAACG CCCGCCACAG GATCCTGCGT TACGACTCTG GCACAAGGCC
CTGCGACAGC TTGCCCGAAG ACAGGTAGAC TGCGCGCCTT GGGAAACACC ACTGGCATTG
GCCCGACGCG TCAGCGAACA ACACCCCGAA CTGGCTGATG CCTTCCAGCG CGTGGCCGAG
GCCTATCTGC AAGCACGCTA CGGCCGCTCT GACAACAACC TGAAAACCCT GCGCGAAGCA
ATCGCGCAAT TGCGATGA
 
Protein sequence
MSTQASEALD RRATPWLFAT ALVTTAPHML HQPPWLSALA GMLLLWATWL WWRDQRLPGR 
WILLPLVGAG CAGILIEFHT LFGRDAGVAM LVIFTTMKLL ELKSRRDAMV LVTLGYFLLL
THYLYSQSIP TGLWLLACLW LVTATLIRLH GGPASNLRHS LRYAALLCLQ AVPFMLALYL
LFPRISGPLW GLPSDAHAGM TGLSDTMSPG SFSQLAQSAD IAFRVRFDGP LPPKQKLYWR
GPVMENFDGT TWRRHEGSQP PERVESLSLP IPYETTLEAH NQRWLLALEA PTGLPPETTL
NGTLSASSRG AITERQRFRL SATLDYRFNS TEDPRIIQRN LALPEGLNPK TRALAKQWQA
SGIPQQAIID KALDLFASEF TYTLRPPLLG RNGIDEFLFQ SHRGFCEHYA AAFIILMRSA
GIPARVVGGY QGGEYNPLDG YLVVRQSDAH AWAEVWISGR GWIRVDPTAA VSPNRIETGI
ADALPFGEPL PALVQWRAEW IRGLRYRWEA INNTWNQNVL GYDPQRQREL LSRLGLADTD
WRSLVTLLGI ICSLLVAAIT AWTIYQRPPQ DPALRLWHKA LRQLARRQVD CAPWETPLAL
ARRVSEQHPE LADAFQRVAE AYLQARYGRS DNNLKTLREA IAQLR