Gene Daro_3215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3215 
Symbol 
ID3566600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3467101 
End bp3468396 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content64% 
IMG OID637681686 
ProductCBS:protein of unknown function DUF21:transporter-associated region 
Protein accessionYP_286415 
Protein GI71908828 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.141334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCT GGATTCTGGT GGCGCTGATC TTGGGCTGTG GCGTTCTGGC CATGGCCGAG 
ATGGCCGTCG GGGCAAGCCG CATCTCCCAT CTCGCCATGC GGGCCGAGCA GGGTTCGGCG
GCGGCATCTG CGGTGCTGGG GTTTCGCCAG CACGCCAGCC GCCTGCTGGC CACGACCCAG
TTGGGCATCA CTGCGTTGGC CATGCTGTCC GGGGTTTATG GCGAAGCGCT ATGGGTGCCG
CGCCTGCAGG CGTGGCTTTC CGGCCTGCTT CCCCTGGCCG ACGGGCCGGC CTACGGCATC
GCGCTGGGCA TCGTCGTCAC CGTCATCACC TTCTTCTCGA TTGTTTTCGG CGAGGTCATC
CCGAAGCGTC TCGCGCTGTC CCATCCGGAA GCGCTGGCCG AAGCGCTGGC CGGGCTGATC
TCCGTGCTAC TCCGGCTGGC CCACCCCTTG GTGCTGGTCG TCTCGCGGAC GGCGGACTGG
ATACTGGCCC TCTTTCCGAG CAAGGGCTCG GCCGAGGCGA CGGCCGCCGA TGAAATCCGC
TTCCTGATCG AAGCTGGCCG CAAGGACGGA AACCTCGACC AGACGGAGAG CGAAATCCTG
GGCAACGTCT TTCGACTCGA CAACCGGCGG GTCGCCGGGA TCATGACACC GGCGGCCGCC
ATCGCCTGCC TGGACCTCAG CATCAGCCGC GAAGAGAACC TGAAGACCTT GCAGGAACGG
GCAGTTTCCC GTTTTCTGGT GTGCAAGGGC GGGATCGCCA ATGCCCTGGG GTTTGTCGAA
AGTCGCGAGC TGCTGCAGGT CCTGCTTAAC GGCAAGGACC TGGATTTCGG CAAGCTGTCG
CCTAATCCGC CGCACTACGT GCCGGGCACG CTATCGCTGA TCGGTCTGCT CGAATTCTTC
AAGGCCAACC AGACGCAAAC GGCACTGGTC GCCAACGAAT TCGGGGCGAC CGAAGGGCTG
GTCACGCTTT CCGACCTGAT GGGCACCGTG GTCGGCGATG TGCTGTCGGG TGCGGTCGAG
TCGCCGCTGG CCATCCAGCG TGGCGACGGC AGCTGGCTGC TCGACGGGCT GCTGGCCATC
GACGAGATGA AGGAGCTGCT CGGCATCAAG GAATTGCCCG AGGAGGATCT GGGCAACTTC
CATACAGTCG GCGGCTTCGT GATCGTTCAT CTCGGCCGGA TTCCGAAAAA GACGGAAGCT
TTCGACTGGG GTGACTGGCA CTTCGAGATC ATGGACATGG AAAAGAACCG GGTCGACGAA
GTACTGGCCA CGCGCGTCAC CGCATTGCCC AGCTAA
 
Protein sequence
MEIWILVALI LGCGVLAMAE MAVGASRISH LAMRAEQGSA AASAVLGFRQ HASRLLATTQ 
LGITALAMLS GVYGEALWVP RLQAWLSGLL PLADGPAYGI ALGIVVTVIT FFSIVFGEVI
PKRLALSHPE ALAEALAGLI SVLLRLAHPL VLVVSRTADW ILALFPSKGS AEATAADEIR
FLIEAGRKDG NLDQTESEIL GNVFRLDNRR VAGIMTPAAA IACLDLSISR EENLKTLQER
AVSRFLVCKG GIANALGFVE SRELLQVLLN GKDLDFGKLS PNPPHYVPGT LSLIGLLEFF
KANQTQTALV ANEFGATEGL VTLSDLMGTV VGDVLSGAVE SPLAIQRGDG SWLLDGLLAI
DEMKELLGIK ELPEEDLGNF HTVGGFVIVH LGRIPKKTEA FDWGDWHFEI MDMEKNRVDE
VLATRVTALP S