Gene Daro_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2102 
Symbol 
ID3566998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2262241 
End bp2264598 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content65% 
IMG OID637680575 
Producthypothetical protein 
Protein accessionYP_285315 
Protein GI71907728 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCC AACTCAGTCT CTTCGCTTTC ATCCTCGGCC TGCTTGCCGT TGTCTGGGTC 
GCCATCGGCT ACATCGGCAG CCACACGTTG GCTTTTTCCG TCTCCTGCAT CATCGCGATG
GTCTATATCG CCGGCGCCCT CGAAATGCGG AGCTTCCACG CCAATACCGC CGCGCTGGCC
AGGGCGCTCG GCAATATTCC GGACGACATC GGCCACCTCG GCGAATGGCT GCAACACGTG
CCAGTCGCGT TGCAAAACGC CGTCCGGCTG CGCATCGAAG GCGAACGCGT TGCCTTGCCG
GGGCCAAACA TCACGCCCTA TCTTGTCGGG TTGCTGGTCC TGCTCGGCAT GCTCGGCACC
TTCCTTGGCA TGGTCGTCAC GCTGAACGGC GCCGTGCTGG CGCTGGAAAG CACCACCGAC
CTGCAAACCA TCCGCGCCGC GCTGGCCGCG CCGGTCAAGG GCCTCGGTGT CGCTTTCGGC
ACCTCCATCG CCGGCGTTGC CACCTCGGCC ATGCTCGGCA TGATCTCCGC CCTCTGCCGG
CGTGAGCGCC TGCAGGTCGG ACAGTTGCTC GACAGCAAGA CAGCCAGCGT GTTGCGGGAT
TTTTCGCTCA ACTACCAGCG CCAGGAAACC TTCAAGGCCC TGCAGGTGCA GGCCCAGGTA
CTGCCCGACC TGGTCGGCAA ACTCGATACC TTGATGACCC AGATGGCCGA GCACAGTCGC
CAGCTCGGCG ACCAGCTGCA GGCCGGCCAG CAGCGATTCC ATGACGAAGC CCGGGGGCTT
TATGCCAACC TCGCCCAGTC CGTCGATCAA TCGCTCAAGG CCAGCCTGAG GGACAGTGCA
ACGGTCGCGG CAGAAACCAT ACAACCGGTG GTCACTGCCA CGATGACAGG CATTGCCAAC
GAAACGCGCG GCCTGCACGA CAAGCTGTTC GGCACCATGG AAACCCAGCT CTCGGGCATC
GCCGGCCAGT TCGCACAAAC CGTCAGCACG GTCAGCGACA CCTGGACACA CGCGCTGAAC
TGCCACGAAC AGGCCACCGA CGCGCAGCAC CAGCAACTGC AAGACGCGCT CACTACCTAC
GCCAACACGT TCGATCAACG TGCCGCTGCG CTGCTCGCCT CGGTCGACAG CACCCAGGCC
AGACTGCAGA CCGAAGCGGC GCAACAACAC AGCTTGCTGG CCGAGACAAC AGCCGCCAGT
CAGCAGCAAC TGGCCAGCAC ACTCGCCAGC CAGTTCGACG GCGTCACCAC CCGCCTTGAT
CAGGCCGTGA GCCAGGTTGC CGACACCTTC AGCGAGCGCA CCCGAAACCT GCTGGATGCC
GTAGACAGCA GCCACACCAC CTTCGCCCAG AGCATGCAGG CCCAGCAGGC CGCCCTGACC
CAGCAGGTCG CCAGCCAGCT CGATGGCATT GCCGAGCGTT TCGATGGCAC GGTGCAAACG
GTTTCCGACA CCTGGAACGG GGCGCTGGCC AAACACGAGC AAGCCAGTGA GCGCCTCACT
CAGGCACTGG ACCAGACCCA GCGCACGCTG GCCGAAACCT TCGTCGAGCG CACTGCTGCG
CTGCTCGCCG ACATGCGCGC CACTCAAGCT GCCTGGCAGA GCGAATGGTC GGCCGGCGAA
CAAGCACGCC AGGCCACCTT CACCGATACC CTGACCGCCA TGTCCGCCAA GATGGAAGCC
CAGTGGCAAC AGGCCGGCCA GTCCACGCTG GCTCAGCAGG AGCAAATCAC TCAAACGCTC
GGTGCCACGG CCCGCGACCT CGTTGCAACC CACCAGCGCC AGGCCGAAAC GACCATCGCC
GAAGTCACCC GCCTGATGCA GACCGCGGCC GAAGCGCCCA AGGCGGCCGC CGAGGTCATC
GGCCAACTGC GCCATGAGCT CTCCGCCAGC ATGGCCCGCG ACAACAGCCT GCTTGAAGAA
CGCAGCCGCA TCATGGAAAC GCTGGGCGCC CTGCTCGATG CCATCAACCA CGCCTCGACC
GAGCAGCGCA GCGCCATCGA TGCGCTGGTC GCCTCGTCGG CCGAACTGCT CGAGCGCGTT
GGCAACCAGT TTGCCGCCAA GGTTGAAGGC GAAGCCGGCA AGCTGGTCGA CATCGGCGCC
AGCATCACCG GCGGCGCCCT CGAAGTCGCC AGCCTGGGCG AGGCTTTCGG CCATGCCGTA
AAGCTCTTCA GCGAATCCAA CGACAAGATG ATGATTGCGC TGCAGCGCAT CGAAGGCGCG
CTGGCCAAGT CGCTCACCCG CAGCGACGAG CAACTCGCCT ACTACGTTGC CCAGGCGCGC
GAAATCATCG ACCTCAGCAT CATGTCGCAG AAAAAAATGG TCGACGACCT GCAACGCGCA
GCGGGTAGCG AGGCGTGA
 
Protein sequence
MNRQLSLFAF ILGLLAVVWV AIGYIGSHTL AFSVSCIIAM VYIAGALEMR SFHANTAALA 
RALGNIPDDI GHLGEWLQHV PVALQNAVRL RIEGERVALP GPNITPYLVG LLVLLGMLGT
FLGMVVTLNG AVLALESTTD LQTIRAALAA PVKGLGVAFG TSIAGVATSA MLGMISALCR
RERLQVGQLL DSKTASVLRD FSLNYQRQET FKALQVQAQV LPDLVGKLDT LMTQMAEHSR
QLGDQLQAGQ QRFHDEARGL YANLAQSVDQ SLKASLRDSA TVAAETIQPV VTATMTGIAN
ETRGLHDKLF GTMETQLSGI AGQFAQTVST VSDTWTHALN CHEQATDAQH QQLQDALTTY
ANTFDQRAAA LLASVDSTQA RLQTEAAQQH SLLAETTAAS QQQLASTLAS QFDGVTTRLD
QAVSQVADTF SERTRNLLDA VDSSHTTFAQ SMQAQQAALT QQVASQLDGI AERFDGTVQT
VSDTWNGALA KHEQASERLT QALDQTQRTL AETFVERTAA LLADMRATQA AWQSEWSAGE
QARQATFTDT LTAMSAKMEA QWQQAGQSTL AQQEQITQTL GATARDLVAT HQRQAETTIA
EVTRLMQTAA EAPKAAAEVI GQLRHELSAS MARDNSLLEE RSRIMETLGA LLDAINHAST
EQRSAIDALV ASSAELLERV GNQFAAKVEG EAGKLVDIGA SITGGALEVA SLGEAFGHAV
KLFSESNDKM MIALQRIEGA LAKSLTRSDE QLAYYVAQAR EIIDLSIMSQ KKMVDDLQRA
AGSEA