Gene Daro_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1098 
Symbol 
ID3569371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1200966 
End bp1201883 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content69% 
IMG OID637679560 
Producthypothetical protein 
Protein accessionYP_284324 
Protein GI71906737 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value0.100036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.664944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCT ACCGCCGCTG GCTTCCAGCC ATCGCCCTCG CCGTCCTGTC CCTGACCTGG 
GGCTACACCT GGGTGCTCGC CAAACAGGGC CTGGCCTACG CACCGCCCTT CGCCTTTGCC
GCTGAACGCT GCGTCGGTGG CGCGCTGTCG CTGCTCGTCG CCCTCAAGCT GACCGGTCGC
CGGCTGACCC TGGTCGCCCC CTTCCAGACC CTCGGCATCG GCCTGACCCA GGTCGCCGGC
TTCATGATCT TCCAGACCTG GGCGCTGGTC GAAGGCGGCC CGGGGAAGAC CGCCGTGCTC
ATCTTCACCA TGCCGATCTG GACCCTGCTC CTCGCCTGGC CGCTGCTCGG CGAGCGGGTG
CGTGGCAAGC AGTGGCTGGC GGCGGCCAGC ACGCTGACCG GCCTGCTGCT GATCATCGAA
CCGTGGGACA TGCACGCCAG CCTGTTCAGC AAATTCCTCG GCCTGATGGC CGCCCTGTGC
TGGGCCAGCG GCACCATCCT GATCAAGCGC CTGCGCGCCG TGACGCCGGT GGACCTGCTG
ACCCTGACCG CCTGGCAGAT GATCCTCGGC GCCGTGCCGC TGGTCCTGCT CGCCCTCGTC
GTGCCCGAAC CGGCCACCCA CTGGACGCCC GCCTACGTCG GCCTCCTGCT CTTCATGTCG
GTGGCCAGCA CGGCGATGTG CTGGTGGCTG TGGATCTATA TCCTCGACCG CGTGCCAGCC
TGGGAAGCCA GCCTGTCGGT GCTCGGCACG CCGGTCGTCG CCATCCTGTC GTCGCGCCTC
ACGTTCGGCG AATCGTTCAA GGGCACCGAG ATCGCCGGCA TCCTGCTCAT CGGCGGCGGC
CTCGCCCTGC TCTCGCTGCT TGGCTGGGCG GCCAGCCGGC GCAATCCGGC GCTCACCCAC
CCCAAGGAAC GCACATGA
 
Protein sequence
MNSYRRWLPA IALAVLSLTW GYTWVLAKQG LAYAPPFAFA AERCVGGALS LLVALKLTGR 
RLTLVAPFQT LGIGLTQVAG FMIFQTWALV EGGPGKTAVL IFTMPIWTLL LAWPLLGERV
RGKQWLAAAS TLTGLLLIIE PWDMHASLFS KFLGLMAALC WASGTILIKR LRAVTPVDLL
TLTAWQMILG AVPLVLLALV VPEPATHWTP AYVGLLLFMS VASTAMCWWL WIYILDRVPA
WEASLSVLGT PVVAILSSRL TFGESFKGTE IAGILLIGGG LALLSLLGWA ASRRNPALTH
PKERT