Gene Daro_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1707 
Symbol 
ID3568561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1833571 
End bp1835670 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content44% 
IMG OID637680174 
Producthypothetical protein 
Protein accessionYP_284924 
Protein GI71907337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.202061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCTA ATGGCAAGCG AAATTATTTA ATTTTTTTCT TTGTAGCAGC TGCCTTATTT 
TGGGTGCTAG CCATTGCGGG TGCATTCCGA TCATATTCGC CTATTCCATT AGGGGACATG
TGGGATGGCT ATCTTGACTT TTACATTAAA GCGAGTGCGG GCGACTGGGG TGTCTGGTGG
AAAACTCACA ACGAGCACCG CATTGTCCTT TCACGCCTCT TTTTCTGGAT GGATCTCTCA
TGGTTTAACG GGTCCGTCTG GTTTCTATTG GCTTTAAATT ATGCTCTTCT TGTATTTTCG
TGTTTTATTT TCTGGATTGC GCTAAAAGAA AGATTGCCGG AACAGTATCA AATTCCTGGA
ATTTTTATCG CCATTTGGTT GAGCTCATGG TCACAGAACG AAAATCTCAC CTGGGGTTTT
CAGAGCCAAT TCATTCTAGC TCAATTGCTG CCCTTGGTAG CCTTCGGCAT GCTGCACAGA
GCTGCTTCTA ATAACTCTAT TTACAATAAT TACTTCATTG GATCATGTTT GATAGGGGTC
TTATCGCTTG GAACCATGGC TAATGGTGTA ATCGTTTTGC CGTTAATGGC CGCATATGCA
ATTATTACTC GCATCGGCTT ATGGAGAGTT CTTACCCTTG CGACTTTATC CTGCATCGGA
TTGCTGGCTT ACTTTTACGA TTACACGTCA CCCGCTTACC ATGGGTCTCT TTCCAGCTCT
TTAAAAGAAA ACCCATTCGG CCTTATTCTC TACGTAATAA CGTATATCGG CGGACCATTT
TACCATATTG CCGGGGGAAA GTTAGGGGGA GTTATTCTGG CCCAAGTTGC AGGCGGCGCC
CTAATTTTAA TTTCAACTTA TATCTCTTGG ACGTCATTAT TAAAATCAAA GAGGGACACG
TTAGAATTAG CTTTGCTGTT TTTTATATTA TACATTGGTG GAACTGCACT CGGGACCGGC
GGAGGAAGGC TTATTTTTGG TGTTGACCAA GCACTTTCAT CACGCTACTC GACGCCATCA
CTAATGGCTT GGGCTGCATT ATTGGTAATA ATCGCGCCAA AGCTCGCCAC ATCATCTGAG
AAAATAAAGT GGCAATTATG GCTGCCTTTA TCAGCCCTTA TCCTTGCAAT GCTTCCGTTG
CAATTCAAGG CTTGGCGTTC AGGCGATCAG AGTGTGTTTG AACGAGGTAT TGCCGGGCTT
GCTGTTGCAA TTGGCGTCAA CGATCAACTA CAAATTAGCC GTATCTATCC TTCTGCTGAG
CGAGCGATAT CGATTGGCAA GGCCGCCTCT GAACAAGAGT TGTCGTATTT CTCTCGAACA
GAATACAAAA ACATTAAAAA TGCCATTGGA AAACAAATTA ATGGGCCTTC CGAAATATCA
AAGGTTTGTC AAGGGCACAT TGACTTTATA GAACCGATTG AGAACGAAAA TAATTACATG
CGAGCGGGTG GTTGGATATT TAATCAATCA GAAAATGTCT CTCCGAGGGC TATTTGGTTA
ATCGATCAAA AGGGAGTCGT TGTCGGCTAT GGGTTAGTAG GCCAGCCTAG ACCTGATGTC
GCAGAGGCTG TTGACAAAAA AGCTTCCAAA TCCGGATTTA AAGGCTATTT TTTGAGTGAG
GCGCAAGGTG CTTCGGTAAT AGTATTTGAC CCAATTAGCA GGTGTGGTTT TTCTTCGGTA
CTGCCTGAGA TATTTTTTAC CTTAACAACT AACGGGAATA AAACCCAAGT AACTGTTGAT
GCGAACCAAG TTCTGCAAAA CAACCAGTGG ATTGGAACTG ACTACCAAAA ATCCAGAATT
GACGGGTTGG TCGTATTTGG CTCCTTCATT CAAGCCGATA GAGACAAAGG CGAAATATCG
CTACAACTTA AGCGTGGAGA TCGAATCCTC TATCGATCAG GGCCAACTTT TGGCAAGCAA
TACTTATCGT TGAGTTTTTC AAATACAGAA ATCGCGCTCC CTGTTTCACT AGAGTGGCGA
GTACTTGATT TTTCGAGTAA GGCTCTGCCA GATAGTTTTA TCGCGACTTT CCGAGATAAT
GGCGAAAACC GGGGCGAATG GTCTGCCATA GCTGTTTTAG CGTCTGAGGT TGTAAAATGA
 
Protein sequence
MLPNGKRNYL IFFFVAAALF WVLAIAGAFR SYSPIPLGDM WDGYLDFYIK ASAGDWGVWW 
KTHNEHRIVL SRLFFWMDLS WFNGSVWFLL ALNYALLVFS CFIFWIALKE RLPEQYQIPG
IFIAIWLSSW SQNENLTWGF QSQFILAQLL PLVAFGMLHR AASNNSIYNN YFIGSCLIGV
LSLGTMANGV IVLPLMAAYA IITRIGLWRV LTLATLSCIG LLAYFYDYTS PAYHGSLSSS
LKENPFGLIL YVITYIGGPF YHIAGGKLGG VILAQVAGGA LILISTYISW TSLLKSKRDT
LELALLFFIL YIGGTALGTG GGRLIFGVDQ ALSSRYSTPS LMAWAALLVI IAPKLATSSE
KIKWQLWLPL SALILAMLPL QFKAWRSGDQ SVFERGIAGL AVAIGVNDQL QISRIYPSAE
RAISIGKAAS EQELSYFSRT EYKNIKNAIG KQINGPSEIS KVCQGHIDFI EPIENENNYM
RAGGWIFNQS ENVSPRAIWL IDQKGVVVGY GLVGQPRPDV AEAVDKKASK SGFKGYFLSE
AQGASVIVFD PISRCGFSSV LPEIFFTLTT NGNKTQVTVD ANQVLQNNQW IGTDYQKSRI
DGLVVFGSFI QADRDKGEIS LQLKRGDRIL YRSGPTFGKQ YLSLSFSNTE IALPVSLEWR
VLDFSSKALP DSFIATFRDN GENRGEWSAI AVLASEVVK