Gene Daro_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0729 
Symbol 
ID3569120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp799360 
End bp802197 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content58% 
IMG OID637679178 
ProductPAS 
Protein accessionYP_283955 
Protein GI71906368 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0635035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACCA ATTTGCCGGT AACCACCGTT GAACAGCATT TGCGGGATGA CACCCTGATC 
GTATCGAAAA CCGATCTCAA GGGACGAATC ACGTATATCA ACCGCGATTT CCTCGATATC
AGTGGTTTTA CTGAACAGGA ACTGATTGGG GAACCCCATA ATCTAGTGCG TCATCCGGAG
ATGCCGCCAG AAGCCTTTGA GGATCTCTGG CGCGACCTGA AGGATGGGCG TCCCTGGACC
GGCATGGTCA AAAACCGGTG CAAGAACGGC GATTACTACT GGGTTCTCGC CACGGTGACA
CCTATCCGCG AGGGCGGTGA AATTCTTGGC TACATGTCGG TGCGCCGCAA GGCCTCGGCC
CAGCAGATTC AGGCCGCCGA AGAGGCCTAT CGGCTATTCC GTGAAAAACG TCAAGGCAGC
CTGCAGATTC GGCATGGAGC GGCCGTCAAG GGCGGACCGG GGCTGTTGTC GGCGCTTTCC
CTGAAAAGCC GCATGGCGGC TGGCTTTGCC GTCATCCTTC TGGTGGTTGC CGTGGTGGCA
GGGCTTGGTC TATGGGGCAT GGGGCGGAGC GACGATGCTG TTGCCAGGCT TTACAGCAGC
CGCCTTGAGC CAGTCCAGGA GTTGGCCGCA ATCGGCAAGC TGATGGCTGA TAATCGGTCG
CAAGTTCTTC TGGCTTTCCA GCACGACCCG GCCAGCCCCA ACGCCAAATC TCACGATCAC
AGCGTAGACA AGCATCTAGG TGTGATCGAT AAGAATATCG GCATCATCAC CGGGCATTGG
GAGCGTTACT CAAAAGCGAT TGCGTCTGAC GAGCACCGCC AGTTGGCAGA CGCCTATGTG
GCCGCCCGGA AGGTGTACGT CACCGAAGGC CTGCTGGCAG CGAAGGCTGC AATCGCGACT
GGTCGCTTCG ACGAAGCCAA CGACATTCTC TTAAAGAAGC TTAATCCGGC CTATGAAGAG
GCTTCAAAGC GTGCTGATGA TCTTTACCAA TTACAGATCA GTCGGGGCAA GACGCAACTG
GAGGAGACTG ACAAGGCTTA CCAGCAGTTC CGCATCATCG TGATTGCCAT TGTCCTGGCC
GCACTTGCTT TCGGCGCTCT TGTTGCTTGG AGCATCATGC GTTCTGTCAT GCGGCCGCTG
GATGACATCA TCGCGACCTT CCAGTCATTG GCTCGCGGCG ACTATACCCG TAACGTCGAT
ATCGCCCGGA ATGACGAACT AGGCAAGGTG ATGCAGGGGC TGCAATCGAT GCAGATTCAG
CAAGGGTTCA ACGTTGCCGA GGCGACACGC GTTGGTGAGG AAAACCTGCG GATCAAGATC
GGCCTCGACA ACGTGGCGAC CAACGTGATG ATTGCCGATG ACGGCCTTAA CATCATTTAC
ATGAATCATG CCGTGACCCA GATGTTCGCT GCGGTGGAGA GCGACATTCG CAAGGATTTG
CCGCAGTTCT CGGCCGCGTC GCTGATGGGT AGCAATATTG ACATCTTCCA CAAGAGCCCG
GCGCATCAGC GCGGCATGCT GGAACGTCTG ACAGGGACGC ATCGCGCCAC GATCAGGCTG
GGCGGGCGTG TCTTCGCGCT GACGGTGACG CCGGTCATCA ATACACGCGG TGGGCGCTTG
GGATTCGCCG TCGAATGGCT GGATCGGACC AACGAGGTTG CGGTCGAAGA GGAAGTGAAC
CAGATCGTCA GCGCTGCGGC GAATGGCGAT TTTACCAAGC GTGTCTCGGA TGCCGGCAAG
ACGGGCTTCT TCCTGACCTT GGCTGGCGAC CTCAATCGCC TGTTGGAAAC CAGCCAGCGA
GGCTTGGAAG ATGTGGTGGT GGTCTTGTCT GCGATGGCGG ATGGCGATCT GACGAAGACC
ATTGAGGCTG AATATGCTGG TACCTTCGGG CAGTTGAAGG ATGATGCCAA TACGACGGTT
GCTCGGTTGC AGGAAATCGT CGGGCAGATC AAAGAGTCAA CCGACGCGAT CAACACGGCG
GCGAAGGAGA TTGCGTCGGG CAACCAGGAC CTGTCGAGCC GGACGGAAGA GCAGGCAAGC
AGCCTGGAAG AAACGGCCTC GAGCATGGAG CAGCTGACCA GCACGGTGAA GCAGAACGCC
GACAATGCAC GGCAAGCCAA CGAGCTGGCT GGTAATGCCC AGCGGGTGGC GGTCAAGGGG
GGCGAAGTGG TGGGTCAGGT TGTGGACACG ATGAGCGCCA TTCACCAATC GAGCAGCAAG
ATTGCCGACA TTATTGGTGT CATTGACGGC ATTGCGTTCC AGACCAACAT TCTGGCACTC
AACGCCGCGG TAGAAGCCGC TCGGGCCGGC GAGCAAGGCC GTGGGTTTGC GGTGGTCGCC
ACCGAAGTGC GTAACCTGGC CCAACGCAGT GCTGCGGCCG CCAAGGAAAT CAAGGGGCTG
ATCTCCGACT CGGTGGAAAA GGTTGAAACC GGCAATAAGT TGGTCGATCA GGCTGGACGG
ACCATGGAAG AAGTGGTCTC GAGCATCAAG CGCGTCGCCA AGATCATGGG CGACATCTCC
GATGCCAGCC GTGAGCAAAG CTCGGGCATC GAGCAAGTCA GCCTGGCCGT CAGCCAGATG
GACGAAGTGA CGCAGCAGAA CGCGGCGCTA GTTGAAGAAG CGGCGGCGGC GGCTGAAAGC
CTGGAAGAGC AGGCTCACAA TCTCGCTCAG GCCGTCTCTG TCTTCAAGGT GGCCAATGCG
GGCGGCATGC CTCGTCTGGA GGCGCCGCGC ACTAGCCAGC GTGCCTCTGT GCCGCAAGCC
CCTCGTGGCG AGCGAATTGG CGCCAGGAAA GTGCAGGCAT TGCCAAGCAG TCTCGATGAT
GAGTGGGAAG AGTTCTGA
 
Protein sequence
MRTNLPVTTV EQHLRDDTLI VSKTDLKGRI TYINRDFLDI SGFTEQELIG EPHNLVRHPE 
MPPEAFEDLW RDLKDGRPWT GMVKNRCKNG DYYWVLATVT PIREGGEILG YMSVRRKASA
QQIQAAEEAY RLFREKRQGS LQIRHGAAVK GGPGLLSALS LKSRMAAGFA VILLVVAVVA
GLGLWGMGRS DDAVARLYSS RLEPVQELAA IGKLMADNRS QVLLAFQHDP ASPNAKSHDH
SVDKHLGVID KNIGIITGHW ERYSKAIASD EHRQLADAYV AARKVYVTEG LLAAKAAIAT
GRFDEANDIL LKKLNPAYEE ASKRADDLYQ LQISRGKTQL EETDKAYQQF RIIVIAIVLA
ALAFGALVAW SIMRSVMRPL DDIIATFQSL ARGDYTRNVD IARNDELGKV MQGLQSMQIQ
QGFNVAEATR VGEENLRIKI GLDNVATNVM IADDGLNIIY MNHAVTQMFA AVESDIRKDL
PQFSAASLMG SNIDIFHKSP AHQRGMLERL TGTHRATIRL GGRVFALTVT PVINTRGGRL
GFAVEWLDRT NEVAVEEEVN QIVSAAANGD FTKRVSDAGK TGFFLTLAGD LNRLLETSQR
GLEDVVVVLS AMADGDLTKT IEAEYAGTFG QLKDDANTTV ARLQEIVGQI KESTDAINTA
AKEIASGNQD LSSRTEEQAS SLEETASSME QLTSTVKQNA DNARQANELA GNAQRVAVKG
GEVVGQVVDT MSAIHQSSSK IADIIGVIDG IAFQTNILAL NAAVEAARAG EQGRGFAVVA
TEVRNLAQRS AAAAKEIKGL ISDSVEKVET GNKLVDQAGR TMEEVVSSIK RVAKIMGDIS
DASREQSSGI EQVSLAVSQM DEVTQQNAAL VEEAAAAAES LEEQAHNLAQ AVSVFKVANA
GGMPRLEAPR TSQRASVPQA PRGERIGARK VQALPSSLDD EWEEF