Gene Daro_3008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3008 
Symbol 
ID3567321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3250051 
End bp3251016 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content63% 
IMG OID637681479 
Producttransglutaminase-like 
Protein accessionYP_286208 
Protein GI71908621 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCTG TCCGTTATCG CGTCCTGCAC GAAACCCGCT ACGACTACGG CAGCCTCGTA 
TCGCTGTCGC AGCAGCAATT GCATCTTTCG CCGCGTGTTC TCGAGTGGCA GCAGGTGGAG
GAGCAGCGCA TCGACATCGA ACCGGTGCCG ACCTGGCGAC GTGACGGACG CGATGCCTTC
GGCAACCCGG TGACCTGGGT GGCCTTTCAT GCCCCGCATG AAATGTTGTT CATTCGGTCA
ATGATGACCG TTGCCGTGAC GCCCCACCTG CCTAAGGTTC TGGAAGACTC CCTGCCCTGG
GAAGAGGTGC GCGACCGCCT GGCCTACGAC GCTACCGACC CGCTGCCGGA AGATCTTGAC
GCGACGCGTT TCCTGTTCGA AAGCCCGCAT GTCCGGATCA AACACGAGCT GGCTGCCTAC
GCCGCCGACT GCTTCCCGCC GGACCGCCCG ATACTGGTTG GTGCCAAGGC CCTGATGGCC
AAGATTTTCC GCGAATTCAC CTTCGACCCC GAGGCGACCA CGGTGTCGAC ACCGGTGCTC
GAGGTGCTCG AAAACAAGCG TGGCGTCTGC CAGGACTTTG CCCACCTGAT GATCGCCTGT
CTGCGCGCCA TGGGCCTGGC CGCCCGTTAC GTCAGCGGCT ACCTGCTGAC CCGACCTCCA
CCGGGCAAGC CACGCCTGAT TGGCGCCGAT GCTTCACATG CCTGGGTATC GGTCTATGCG
CCGGGCAGCC AGCAGGGGGG GGGCGACTGG GTCGATTTCG ATCCGACCAA TGACCTGTTG
CCGGATACCG AACACATCAC CCTGGCTTTC GGTCGCGACT TCTCCGATAT CTCACCGCTA
CGCGGCATCA TCCTGGGCGG TGGCGGCACT GAACCCGATG TTGCGGTGAC CGTCGTTCCG
CTCGATGAGG AAGAAATTCC CGAAGAGATG CTCGACGAGC CGGACTCTGA AGCGGACGAA
GCATGA
 
Protein sequence
MTPVRYRVLH ETRYDYGSLV SLSQQQLHLS PRVLEWQQVE EQRIDIEPVP TWRRDGRDAF 
GNPVTWVAFH APHEMLFIRS MMTVAVTPHL PKVLEDSLPW EEVRDRLAYD ATDPLPEDLD
ATRFLFESPH VRIKHELAAY AADCFPPDRP ILVGAKALMA KIFREFTFDP EATTVSTPVL
EVLENKRGVC QDFAHLMIAC LRAMGLAARY VSGYLLTRPP PGKPRLIGAD ASHAWVSVYA
PGSQQGGGDW VDFDPTNDLL PDTEHITLAF GRDFSDISPL RGIILGGGGT EPDVAVTVVP
LDEEEIPEEM LDEPDSEADE A