Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daro_0729 |
Symbol | |
ID | 3569120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dechloromonas aromatica RCB |
Kingdom | Bacteria |
Replicon accession | NC_007298 |
Strand | + |
Start bp | 799360 |
End bp | 802197 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637679178 |
Product | PAS |
Protein accession | YP_283955 |
Protein GI | 71906368 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0635035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACCA ATTTGCCGGT AACCACCGTT GAACAGCATT TGCGGGATGA CACCCTGATC GTATCGAAAA CCGATCTCAA GGGACGAATC ACGTATATCA ACCGCGATTT CCTCGATATC AGTGGTTTTA CTGAACAGGA ACTGATTGGG GAACCCCATA ATCTAGTGCG TCATCCGGAG ATGCCGCCAG AAGCCTTTGA GGATCTCTGG CGCGACCTGA AGGATGGGCG TCCCTGGACC GGCATGGTCA AAAACCGGTG CAAGAACGGC GATTACTACT GGGTTCTCGC CACGGTGACA CCTATCCGCG AGGGCGGTGA AATTCTTGGC TACATGTCGG TGCGCCGCAA GGCCTCGGCC CAGCAGATTC AGGCCGCCGA AGAGGCCTAT CGGCTATTCC GTGAAAAACG TCAAGGCAGC CTGCAGATTC GGCATGGAGC GGCCGTCAAG GGCGGACCGG GGCTGTTGTC GGCGCTTTCC CTGAAAAGCC GCATGGCGGC TGGCTTTGCC GTCATCCTTC TGGTGGTTGC CGTGGTGGCA GGGCTTGGTC TATGGGGCAT GGGGCGGAGC GACGATGCTG TTGCCAGGCT TTACAGCAGC CGCCTTGAGC CAGTCCAGGA GTTGGCCGCA ATCGGCAAGC TGATGGCTGA TAATCGGTCG CAAGTTCTTC TGGCTTTCCA GCACGACCCG GCCAGCCCCA ACGCCAAATC TCACGATCAC AGCGTAGACA AGCATCTAGG TGTGATCGAT AAGAATATCG GCATCATCAC CGGGCATTGG GAGCGTTACT CAAAAGCGAT TGCGTCTGAC GAGCACCGCC AGTTGGCAGA CGCCTATGTG GCCGCCCGGA AGGTGTACGT CACCGAAGGC CTGCTGGCAG CGAAGGCTGC AATCGCGACT GGTCGCTTCG ACGAAGCCAA CGACATTCTC TTAAAGAAGC TTAATCCGGC CTATGAAGAG GCTTCAAAGC GTGCTGATGA TCTTTACCAA TTACAGATCA GTCGGGGCAA GACGCAACTG GAGGAGACTG ACAAGGCTTA CCAGCAGTTC CGCATCATCG TGATTGCCAT TGTCCTGGCC GCACTTGCTT TCGGCGCTCT TGTTGCTTGG AGCATCATGC GTTCTGTCAT GCGGCCGCTG GATGACATCA TCGCGACCTT CCAGTCATTG GCTCGCGGCG ACTATACCCG TAACGTCGAT ATCGCCCGGA ATGACGAACT AGGCAAGGTG ATGCAGGGGC TGCAATCGAT GCAGATTCAG CAAGGGTTCA ACGTTGCCGA GGCGACACGC GTTGGTGAGG AAAACCTGCG GATCAAGATC GGCCTCGACA ACGTGGCGAC CAACGTGATG ATTGCCGATG ACGGCCTTAA CATCATTTAC ATGAATCATG CCGTGACCCA GATGTTCGCT GCGGTGGAGA GCGACATTCG CAAGGATTTG CCGCAGTTCT CGGCCGCGTC GCTGATGGGT AGCAATATTG ACATCTTCCA CAAGAGCCCG GCGCATCAGC GCGGCATGCT GGAACGTCTG ACAGGGACGC ATCGCGCCAC GATCAGGCTG GGCGGGCGTG TCTTCGCGCT GACGGTGACG CCGGTCATCA ATACACGCGG TGGGCGCTTG GGATTCGCCG TCGAATGGCT GGATCGGACC AACGAGGTTG CGGTCGAAGA GGAAGTGAAC CAGATCGTCA GCGCTGCGGC GAATGGCGAT TTTACCAAGC GTGTCTCGGA TGCCGGCAAG ACGGGCTTCT TCCTGACCTT GGCTGGCGAC CTCAATCGCC TGTTGGAAAC CAGCCAGCGA GGCTTGGAAG ATGTGGTGGT GGTCTTGTCT GCGATGGCGG ATGGCGATCT GACGAAGACC ATTGAGGCTG AATATGCTGG TACCTTCGGG CAGTTGAAGG ATGATGCCAA TACGACGGTT GCTCGGTTGC AGGAAATCGT CGGGCAGATC AAAGAGTCAA CCGACGCGAT CAACACGGCG GCGAAGGAGA TTGCGTCGGG CAACCAGGAC CTGTCGAGCC GGACGGAAGA GCAGGCAAGC AGCCTGGAAG AAACGGCCTC GAGCATGGAG CAGCTGACCA GCACGGTGAA GCAGAACGCC GACAATGCAC GGCAAGCCAA CGAGCTGGCT GGTAATGCCC AGCGGGTGGC GGTCAAGGGG GGCGAAGTGG TGGGTCAGGT TGTGGACACG ATGAGCGCCA TTCACCAATC GAGCAGCAAG ATTGCCGACA TTATTGGTGT CATTGACGGC ATTGCGTTCC AGACCAACAT TCTGGCACTC AACGCCGCGG TAGAAGCCGC TCGGGCCGGC GAGCAAGGCC GTGGGTTTGC GGTGGTCGCC ACCGAAGTGC GTAACCTGGC CCAACGCAGT GCTGCGGCCG CCAAGGAAAT CAAGGGGCTG ATCTCCGACT CGGTGGAAAA GGTTGAAACC GGCAATAAGT TGGTCGATCA GGCTGGACGG ACCATGGAAG AAGTGGTCTC GAGCATCAAG CGCGTCGCCA AGATCATGGG CGACATCTCC GATGCCAGCC GTGAGCAAAG CTCGGGCATC GAGCAAGTCA GCCTGGCCGT CAGCCAGATG GACGAAGTGA CGCAGCAGAA CGCGGCGCTA GTTGAAGAAG CGGCGGCGGC GGCTGAAAGC CTGGAAGAGC AGGCTCACAA TCTCGCTCAG GCCGTCTCTG TCTTCAAGGT GGCCAATGCG GGCGGCATGC CTCGTCTGGA GGCGCCGCGC ACTAGCCAGC GTGCCTCTGT GCCGCAAGCC CCTCGTGGCG AGCGAATTGG CGCCAGGAAA GTGCAGGCAT TGCCAAGCAG TCTCGATGAT GAGTGGGAAG AGTTCTGA
|
Protein sequence | MRTNLPVTTV EQHLRDDTLI VSKTDLKGRI TYINRDFLDI SGFTEQELIG EPHNLVRHPE MPPEAFEDLW RDLKDGRPWT GMVKNRCKNG DYYWVLATVT PIREGGEILG YMSVRRKASA QQIQAAEEAY RLFREKRQGS LQIRHGAAVK GGPGLLSALS LKSRMAAGFA VILLVVAVVA GLGLWGMGRS DDAVARLYSS RLEPVQELAA IGKLMADNRS QVLLAFQHDP ASPNAKSHDH SVDKHLGVID KNIGIITGHW ERYSKAIASD EHRQLADAYV AARKVYVTEG LLAAKAAIAT GRFDEANDIL LKKLNPAYEE ASKRADDLYQ LQISRGKTQL EETDKAYQQF RIIVIAIVLA ALAFGALVAW SIMRSVMRPL DDIIATFQSL ARGDYTRNVD IARNDELGKV MQGLQSMQIQ QGFNVAEATR VGEENLRIKI GLDNVATNVM IADDGLNIIY MNHAVTQMFA AVESDIRKDL PQFSAASLMG SNIDIFHKSP AHQRGMLERL TGTHRATIRL GGRVFALTVT PVINTRGGRL GFAVEWLDRT NEVAVEEEVN QIVSAAANGD FTKRVSDAGK TGFFLTLAGD LNRLLETSQR GLEDVVVVLS AMADGDLTKT IEAEYAGTFG QLKDDANTTV ARLQEIVGQI KESTDAINTA AKEIASGNQD LSSRTEEQAS SLEETASSME QLTSTVKQNA DNARQANELA GNAQRVAVKG GEVVGQVVDT MSAIHQSSSK IADIIGVIDG IAFQTNILAL NAAVEAARAG EQGRGFAVVA TEVRNLAQRS AAAAKEIKGL ISDSVEKVET GNKLVDQAGR TMEEVVSSIK RVAKIMGDIS DASREQSSGI EQVSLAVSQM DEVTQQNAAL VEEAAAAAES LEEQAHNLAQ AVSVFKVANA GGMPRLEAPR TSQRASVPQA PRGERIGARK VQALPSSLDD EWEEF
|
| |