Gene Daro_0009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0009 
Symbol 
ID3570033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp12950 
End bp13990 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content60% 
IMG OID637678438 
ProductApbE-like lipoprotein 
Protein accessionYP_283238 
Protein GI71905651 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value0.109653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0510654 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCTG TCCTGAATCT TTTCCTGTCT GGCGTTTGCG TACTGCTGCT TGCCAGCTGT 
GGTCGGACGC AGCTGCAAGA ACAGCAGGCC TACGTCTTTG GTACCCGTGT TGAGGTGCTG
GTCGTCAGCG AGGATCCGGA ACAGGGCCGC AAAGCGATTG CCGCCGTCTT GCGCGAATTC
GACCGCCTGC ACCGCGCCTA CCACGCTTGG CAGGACTCGG AACTGATGGC CTTGAACACG
GCATTTGCCC AGGGAAAAAC CCGACAGGTC AGCCCTGAAC TCGCCGCCTT CGTTCAGGAA
GCACAGGCCC TTTCCCAACA GGGCGACACC CTGTTCGATC CCGGCATTGG TCAGTTGATC
AAACTGTGGG GCTTCCAGGC CGACGAATTC AAGGCAGAAC TGCCTGCTGC AGCCGATATC
AAGGCCTGGT TGGCCAGCAA GCCATCCATT GCCGACGTCG TGATCGACGG CACCAATATC
CGCAGCCGCA ATCGTAACGT TGCCCTCGAT TTCGGCGGCT ACCTGAAGGG TGTCGCCCTT
GATCGCGCCT CGGCCATCCT CCACGCTCAA GGTATCCACA ACGCCCTGAT CAACATCGGC
GGCAATGTCA TGGCGCTGGG CAGCAAGGAA GGCAAGAAGT GGCGCGTCGG CATCCAGCAT
CCACGTCAGC CGGGTCCGAT GGCCACGGTC ACGCTCGATG ACGGCGAAGC GATCGGCACC
TCCGGCGACT ATCAACGCTT CTTCGAGGTC GACGGACGAC GTTACGCCCA CCTGCTCGAT
CCTCGCACCG GCTACCCGGT GGAACACACG CAGGCTGTCA CGGTGCTCAT CCCCAAGGGG
CCAAAAGCAG GCACCTTGTC CGATGCGGCC TCCAAGCCGA TTTTCATTGC AGGACCGGAT
GGCTGGCGCG ATATGGCGCG AAAAATGGGA ACCAGTCTCG TTTTGCGCGT CGACCATAGC
AATCAGATTT TCGTCACCGA GGCACTGCGC CAGCGTCTTG AATTCATCGG CGCCCCCCCG
AAACTCAACG TTGTCCAATA A
 
Protein sequence
MRAVLNLFLS GVCVLLLASC GRTQLQEQQA YVFGTRVEVL VVSEDPEQGR KAIAAVLREF 
DRLHRAYHAW QDSELMALNT AFAQGKTRQV SPELAAFVQE AQALSQQGDT LFDPGIGQLI
KLWGFQADEF KAELPAAADI KAWLASKPSI ADVVIDGTNI RSRNRNVALD FGGYLKGVAL
DRASAILHAQ GIHNALINIG GNVMALGSKE GKKWRVGIQH PRQPGPMATV TLDDGEAIGT
SGDYQRFFEV DGRRYAHLLD PRTGYPVEHT QAVTVLIPKG PKAGTLSDAA SKPIFIAGPD
GWRDMARKMG TSLVLRVDHS NQIFVTEALR QRLEFIGAPP KLNVVQ