Gene Daro_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3000 
Symbol 
ID3567313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3238074 
End bp3239696 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content64% 
IMG OID637681471 
Producthypothetical protein 
Protein accessionYP_286200 
Protein GI71908613 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.0521457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAT TAAGCACCCT GCTCCGTCGC GGTATCCGCT TGCCTCCGGC CGGCTGGATG 
CTTGCCGCCA TGCTCGCCTT CTACGTGCTG GCCGGCCTGT TCGGCCGCGA TCCGTGGAAG
GGCGAAGACG CCATTCACAT CGGTGCAGCA TGGCACATGC TGCACTTCAG CGACTGGCTG
TCTCCCGACC TGGCCGGCCG GCCATTCCAC GAACCGCCGT TGTACTACTG GAGTGCCGCG
CTGACCGGCA AGGCATTCGG CTGGCTGCTC CCTCTCCACG AGGCCATGCG TCTCGCCAGC
GGCATCTGGG TCACCTTGGC CTTGATGGGC CTGTACTACG CCAGCCGCGA GCTGTACGGC
GAGGACTCTG CAGCCGCGAG CCCGATGTTG CTGGCAGGTT GCGCCGGCCT GCTCTTCCAT
GCCCACGATG CGCAGCCGAT GCTGATTGCG CTTGCCGCCT ATAGTGGCGC TCTGGGTGGC
TTGGCCGCGA TTGGGCGCAA GCCACGACTG ACCGGCATCT ACTACGGACT GGCCGTTGCC
GGTTGCCTGC TCGGCACCGG CATTGCCCCA ACCCTGCCGT TACTTGCCAT CGCCCCGGTC
GCCTGGTGGT TATCCCCTGA CCGCCCCAAG GCCTTGCACA CACTGCTCAT CGGTTTTGGC
ATCGCGACGG TGCTGATTCT GCCATGGCCA CTGTTACTGC TGACCCTTGA ACCAGCACGT
TTTCATGGCT GGCTAGCCAC TGAACTGGCG CCGCTGAAGA CCCCGTTCTC CTTCGGCGGC
GGCAGCCGTT TCCTGGCCAT GTTGCCCTGG TTTGCCTTCC CCGCCATGCC GCTGGCGGCC
TGGACCTTAT GGACCAGACG CAAGGAACTG CAGACCCCGT CGTTCTTGTT GCCGCTCTCC
TTCCTGCTGA TCACCCTGCT GATGCTGGCC TGGGCGTTCC GCCCCCGGGA AATCCCATCC
CTGTTGCTCT TGCCGTCACT AGCCCTGCTC GCCACGCCTG GCACGCTGGC CCTGCGTCGC
GGTGCAGCCA ATGCCTTCGA CTGGTTTGCA ATGTCGACCT TCAGCCTGTT CGTCGCCGTC
GTCTGGCTGG CCTGGTCGGC AATGGCCCTT GGCTGGCCAT CAAAACTCGC CGAGCGGGCA
CTGATCCTGC GCCCCGGTTT TGTCGGCCAC TTCAGTCTGG TCGCACTGCT CATCGGCCTT
GCAGCCACGG CGTGGTGGAT CTGGCTGATC ATCACTGCCC CACGCTCACC GTATCGCAGC
CTGACGCACT GGACACTCGG CTTCACAACC TTGTGGCTAC TGGCAACAAC GCTGATCCTG
CCATGGTTCG ACTACGGCAA GACCTACCGG CCGGTCGCCC AGGCGATCGC CCAGGCCCTG
CCAGCCGATC ACGGCTGCCT GGCCGAACGC GGCTTGAGCG AAACCCAGCT GGCGTCGATG
TCCTATTTCG TCGAAATCGA GCCGGTTGCC GAAGACTCCA AGGCGGGCCA AGCTTGCAAC
TGGCTGCTGG TAGTCGGCGA CACGCGCCGT GAGTTGGCAG CACCGGGCAA GCAGTGGTCC
AAGGTCTGGG AAGGCAGCCG CCCCGGGGAC CGCAAGGAAA AATTCCGTCT TTTCCGGCGC
TAA
 
Protein sequence
MSELSTLLRR GIRLPPAGWM LAAMLAFYVL AGLFGRDPWK GEDAIHIGAA WHMLHFSDWL 
SPDLAGRPFH EPPLYYWSAA LTGKAFGWLL PLHEAMRLAS GIWVTLALMG LYYASRELYG
EDSAAASPML LAGCAGLLFH AHDAQPMLIA LAAYSGALGG LAAIGRKPRL TGIYYGLAVA
GCLLGTGIAP TLPLLAIAPV AWWLSPDRPK ALHTLLIGFG IATVLILPWP LLLLTLEPAR
FHGWLATELA PLKTPFSFGG GSRFLAMLPW FAFPAMPLAA WTLWTRRKEL QTPSFLLPLS
FLLITLLMLA WAFRPREIPS LLLLPSLALL ATPGTLALRR GAANAFDWFA MSTFSLFVAV
VWLAWSAMAL GWPSKLAERA LILRPGFVGH FSLVALLIGL AATAWWIWLI ITAPRSPYRS
LTHWTLGFTT LWLLATTLIL PWFDYGKTYR PVAQAIAQAL PADHGCLAER GLSETQLASM
SYFVEIEPVA EDSKAGQACN WLLVVGDTRR ELAAPGKQWS KVWEGSRPGD RKEKFRLFRR