Gene Daro_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4068 
Symbol 
ID3566909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4364033 
End bp4365658 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content66% 
IMG OID637682540 
Producthypothetical protein 
Protein accessionYP_287264 
Protein GI71909677 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAC CTCTCTACCT CGCCAAGTCC GAAGACGGTT ATCCCGCATT GCTGCCGCAG 
ATGGCCAACC GCCACGGGCT GATCACCGGC GCCACCGGCA CCGGCAAGAC GGTCACCCTG
CAGTCGATGG CCGAACGCCT GTCCTACGCC GGCGTGCCGG TCTTCATGGC CGATGTGAAG
GGCGACCTCT CCGGCATGGG CGCCGCCGGC ACCCTGACCC CGAAGCTGGA AACCCGCCTC
AAGGACCTCG GCCTCGAAGG CTTCGCCCCC TACGCCAACC CGGTTGCCTT CTGGGATGTC
TTCGGCCAGG GCGGCGTGCC GGTGCGCGCC ACCATCTCCG ACATGGGCCC GCTGCTCCTC
GCCCGCCTGC TGAACCTGAA CGACACCCAG ACCGGCGTCC TGCAACTGGT CTTCAAGATC
GCCGACGACA AGGGCCTGCT GCTGATCGAC CTGAAGGACC TGCGCGCCTG CATCCAGTAC
GTCGGCGAAA ACGCCAAGGA CTTCACCACC GAATACGGCA ATGTCTCGAC CGCCTCGATC
GGCGCCATCC AGCGCGGCCT GCTGACGCTG GAAGAGCAGG GCGGCGACCG GTTCTTCGGC
GAACCGATGC TCAATATCAA CGACCTGATG AAGGTCGACG AAAACGGCCG CGGCGTCATC
AACGTCCTCG CCGCCGAAAA GCTGGTCCAA GCCCCGGCGC TCTACTCCAC CTTCCTGCTC
TGGCTGCTCT CCGAACTGTT CGAACAACTC CCGGAAGCCG GCGATCTGGA CAAGCCCAAG
CTCGTCTTCT TCTTCGACGA AGCCCATCTG CTGTTCACCG ACGCCCCGCA GGCGCTGACC
GACAAGGTCG AGCAGGTGGT CCGTCTGATC CGCTCCAAGG GCGTCGGCGT CTATTTCGTC
ACGCAGAACC CGCTCGACGT CCCTGAAAAG ATCCTCGGCC AGCTCGGCAA CCGCGTCCAG
CATGCCCTGC GCGCCTTCAC GCCGCGTGAC CAGAAGGCCG TCCAGGCAGC AGCGCAAACC
ATGCGCGCCA ACCCGAAATT CGATGCCGCC ACCGTGATCA CCGAACTCGG CGTCGGCGAA
GCGCTGGTTT CCTTCCTCGA CGAAAAGGGC AGGCCAACCA TGGTTGAGCG CAGCACCATC
TTCCCGCCCG CCTCCCGCCT CGGCCCACTG ACCGCCGACG AACGCCAGGC CATGATCAAC
GCCTCGCCGA TGCTCGCCAC CTACGGCCAG ACCGTCGACC GCGAATCCGC CTACGAAATC
CTGCGCGGCA AACCCGCCGC CACGCAAGCC GCCCCCGGCG CCATTCCGGC GCCACCGGCC
GGCAACAGCA GCCTCAACGA CAGCGACTGG GGCAACCATG CCAACCAGCA ACAGCCGCGC
TATGAACAAG CCCCGCAACC CCGACAGAGC GCCCCCGCCC CGCAGGAAAG CTCGGGCGGT
GGTCTGTTCG GCGGCCTCGG CGACATCCTG ACCGGCACCA CCGGCCCACG CGGTGGTCAT
CGCGAAGGCG TGCTCGAAAG TGCCGCCAAG AGCGCCGCTC GCGGCGTTGC CGGCACGGTT
GGCCGGGAGA TTGGGAAGCA GATTCTGCGC GGGGTGCTGG GGTCAATCCT GGGCGGACGA
CGCTAA
 
Protein sequence
MSEPLYLAKS EDGYPALLPQ MANRHGLITG ATGTGKTVTL QSMAERLSYA GVPVFMADVK 
GDLSGMGAAG TLTPKLETRL KDLGLEGFAP YANPVAFWDV FGQGGVPVRA TISDMGPLLL
ARLLNLNDTQ TGVLQLVFKI ADDKGLLLID LKDLRACIQY VGENAKDFTT EYGNVSTASI
GAIQRGLLTL EEQGGDRFFG EPMLNINDLM KVDENGRGVI NVLAAEKLVQ APALYSTFLL
WLLSELFEQL PEAGDLDKPK LVFFFDEAHL LFTDAPQALT DKVEQVVRLI RSKGVGVYFV
TQNPLDVPEK ILGQLGNRVQ HALRAFTPRD QKAVQAAAQT MRANPKFDAA TVITELGVGE
ALVSFLDEKG RPTMVERSTI FPPASRLGPL TADERQAMIN ASPMLATYGQ TVDRESAYEI
LRGKPAATQA APGAIPAPPA GNSSLNDSDW GNHANQQQPR YEQAPQPRQS APAPQESSGG
GLFGGLGDIL TGTTGPRGGH REGVLESAAK SAARGVAGTV GREIGKQILR GVLGSILGGR
R