Gene Daro_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0158 
Symbol 
ID3569525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp172892 
End bp174088 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content65% 
IMG OID637678593 
Productglycosyl transferase, group 1 
Protein accessionYP_283387 
Protein GI71905800 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCC CCAAGCGCGT CCTGCTGCTC GATACCGGTA ACGAATGGGG CGGCGGCACC 
AACAGCATGT TTGAACTGCT GAAGCGCATC GACCACTCGC GCTTTGCCGT CACCTGCTGC
TTCTACAAGG ATTATCGCAA GGGCAGGCAC GGCCGGCTGC TCTCCGAGGA GTTGGCCGAT
ATCGGTATTC CGCTGATCGT GCTGCCCAGC CGCAAGCAGC CGCTGTGGGC CAAGCTGGCG
AAGGAGATCG CGCGCGGCCT GTTGTCGTGG TCCGGCCGGC TGAAGAAGCG GGCGGTGCTG
GCCATCGAAA TGCAGTGGCG CATCAAACCG CGCGTTGCCG CGCTGAAGCT GCTACTCCAA
GAAGGCGCTT ACGATCTGCT GTACATGAAC AACCAGCCGT CGTCGAACCT GGAGGGCTAT
CTCGCTGCCG AGGCAGCCAG CCTGCCGGTC GTCCAGCACT GCCGCATCGA GCCGACCCTG
CAGGCCAACG AGGCCGCCGT GGTCAACCGC ACGGCGACGC GCATCATCTG CGTCTCGCAG
GGGGTGGCCG ATGTGCTGGC GGCACAGCAT GTCGCTGAAA ACCGGCTGCG CGTGGTCTAT
AACGCCATCG ACAGCCGCAT TGCCTTGCCC GCCCCGGTCA GACTGCCGCC GACGACCAAG
GACCTGGTCA TTGGTACGGT CGGCCAGTTG ACGGCGCGCA AGGGTGTGCT GCACCTGCTG
CAGGCCGTGG CCAACCTGAA GGCAGAAGGC TTGCCGGTGA CTTGCCTGAT TCTGGGCGAG
GGGCCACAGC GGACTGAACT GGAAAGTGCC GTCGGTCGTC TGGGGCTGCT CGATCAGGTC
AGTTTTGTCG GCTTCCAGTC CGTGCCGCTG GCCTGGGTGC AGGTAATGGA CGTCTGTGTC
CTCTGTTCGA GCAAGGAAGG CCTGCCACGC GTCGTGCTCG AAGCCATGTT GGCGGGCAAG
CCGGTGGTCG GCTCCGACGT GACCGGCACC CGCGAACTGA TCGTTCACGA GGAAACCGGT
TTGCTCTACG CCTATGGCGA CGTGGCGGCG CTGACCGCAT CATTGCGCCG GCTGTTGTCC
GATGCCGAAC TGCGCCGGAG AATGGGCGCG GCGGGTTGCC AGCGGGTGGC CGAACGCTAT
TCGATCGAGG CCTACGTGGC CGGCGTGATG CAGGTTCTGG ACGAGGCGGC CCGGTGA
 
Protein sequence
MTSPKRVLLL DTGNEWGGGT NSMFELLKRI DHSRFAVTCC FYKDYRKGRH GRLLSEELAD 
IGIPLIVLPS RKQPLWAKLA KEIARGLLSW SGRLKKRAVL AIEMQWRIKP RVAALKLLLQ
EGAYDLLYMN NQPSSNLEGY LAAEAASLPV VQHCRIEPTL QANEAAVVNR TATRIICVSQ
GVADVLAAQH VAENRLRVVY NAIDSRIALP APVRLPPTTK DLVIGTVGQL TARKGVLHLL
QAVANLKAEG LPVTCLILGE GPQRTELESA VGRLGLLDQV SFVGFQSVPL AWVQVMDVCV
LCSSKEGLPR VVLEAMLAGK PVVGSDVTGT RELIVHEETG LLYAYGDVAA LTASLRRLLS
DAELRRRMGA AGCQRVAERY SIEAYVAGVM QVLDEAAR