Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_06870 |
Symbol | retS |
ID | 7759640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 650050 |
End bp | 652851 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643803608 |
Product | hybrid histidine protein kinase RetS |
Protein accession | YP_002797912 |
Protein GI | 226942839 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.577631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCCGGC CTTGGAATGC CACAACGCTC ATCGTCAGTC TGCTGCTGGT CGTACTCGCG GCCTTGCCGA CCCTGGCAGG CAGCCTGCTG CGCTCGGACG AACAGTTCGG CCCGCTCGGC CCAGGCGCAC CGAACGGCGA CTGGCGAATC CTTCTCGACG AATCGGCCAG CCTCACCCTG AAGGATGTCA TCGAGCGGCG CGACCATTTC GCCCCGCTCG GCCACCGCTC CCTCACCCTG CCGGCCAACC AGGCCGCCTG GCTGCGCGTG TCCATCGCCG GGCACGACAC CCCGCGCTGG ATCTGGGTGT TCGCTCCGCG CGTGGACCGG GTCGACTTCT TCCTGACCAA CAGGGGCGCG ACGGAGCGCC GGATCGAGAC CGGAGCCATG CTCCCGGACG GCCTCTCCAC GTCCGGGCAG GCCCACCTGT TCGACCTGCC GACGGACCAG ACAACGCGCG AGGTCTGGCT GCGACTGGCT CCCCGGCAAG CCGCGCCGGC CTGGTTCGAC TACGTGGATA CCGCCGGACT GCTGGACAAG AACAAGCTGG CCTATACCCT CGGCGCCCTG CTCAGCGCCC TCGCCCTCGG GATGATCTAC CACCTGGTCC GTTTCGGCTA CAACAGGGTG CTCTGCAATC TCTGGCTGTC GGCCATGCAG GGCACGCTCC TGCTCAGCGC CATCGCGCAT TTCGGCCTGC TCGGCGCCTG GTTGCCGCAA CTCGGGCATT ACCAGATCCG GGTGGCGGAT ATCACCGCCC TCCTCAGCTT CCTCTTCCTG CTGCAGTTCA CTCGCAGCTT CTTCGACCAG GAGGAGTCCC CGCGCCTGCT CGACTGGCTG CTGCGCGGCG AGATCGCCGT GCTGGCACTG GCCGCCGTGC TCGACGGCGC CTCGCCCCAC GACATGCTGC GCAGCGGGCT GATCCTGCTC GCCAGCCTGA GCGCGACGGG CTGCGCCCTC TACCACTGGC TGCACGGCTA TCGCCCGGCA CGCCTGGCGC TGGTCGGGAC GCTGCTCTTC CACCTGGGTT TCAGCGCCTA TCTGTCGATG CTGCTGGATT TCGGCCAGTT CGGCCCCTGC TGGCAGATCT TCGCGCTCTT CGCCTTCACC GCCGCCAGCG GCCTGCTGCT CGTCTTCGCC CTGGCCGAGC GGCAGTGCAA GATCGAGCGG GACCGCGCCG CCTGGCGCAC CAGCGAAGCG GCGAGCAGCG CCGAATTGCG CGCCAAGAGC GAATTCCTGG CCAAGCTCAG CCATGAGATC CGCACCCCGA TGAACGGCGT CCTGGGCATG ACCGAACTGC TGCTCGGCAC GCCGCTGACC GCCAAGCAAT ACGACTACGT GCAGACGATC AACAGCTCGG GCAACGAGCT GCTCAGCCTG ATCAACGACA TTCTCGACAT CTCCCGCCTG GAATCCGGCC AGATCGAACT GGACGACGTG CAGTTCGACC TCAATGGCCT GATCGAGGAC TGCCTGGGCA TCTTCCGCGC CAAGGCGGAA AGGCGCCATG TCGAGCTGAT CAGCTTCATC CAGCCGCAGG TGCCGCGGGT GATCAACGGC GACCCGACGC GGCTGCGCCA GATCCTGCTG ACCCTGATCG AGAACGCCTT CCGGCAGACC GACGAAGGCG AAATCCTGCT CAAGGTGTCG CTGGAAACCG AGGGTCCCCG GCCGAACCTG CACGTCGCCG TGCAGGACAG CGGGGAACCG ATGCCGGAGT CCAGACGCGA AGCGATGCTG AACAGCCGCG TGCAGAGCAG CGACCTGCTC TCCGCCGCGC AGATCGACGG CAGCCTCGAC CTGATCATCG CCCGTCAACT GATCACCCTG ATGCACGGTA TCTGCGGCAT CGATGCCGAT CCGACCCGCG GCACCACGCT CTGGCTGTCG CTGCCCCTCG ACCCGGCCCG CCTGGAGCAT CCGGTCGCCG ACCTGGACAG CCCGCTGCAG GACGCCCGCC TGCTGGTGGT GGACGACAAC GACACCTGCC GCAAGGTCCT GCTGCAGCAG TGCGGGGCCT GGGGCATGAA GGTCAGTGCG GTCTCCTCCG GCAAGGAGGC CCTGGCCCTG CTGCGCACCA AGGCCGATCT GGGGGAATAC TTCGACGCGG CGCTGCTCGA CCAGAACATG CCCGGCATGA GCGGCCTGCA ACTGGCCCAG CGGATCAAGG AAGACCCCGG CCTGAATCAC GACATCCTGC TGGTCATGCT GACCGGCCTC AGCGATGCGC CGGGACGGGT CTACGCGCGC AACGCCGGCA TCACCTGCAG CCTGGCCAAG CCGGTGGCCG GCTACACGCT GAAGACCACC CTCGCCGAGG AACTCGGCAA GCGCCGCGCG GCCCGGCCGG CCGCCAGCGG CCTGGAGGAG AACGCCCCCC TGGAAGTGCC GAGCGACTTC CGCATCCTGG TCGCCGAGGA CAACAGCATC TCCACCAAGG TCATCCGCGG CATGCTGCGC AAGCTCAACC TGCAACCGGA TACCGCCAGC AACGGCGAGG AGGCCCTGCA GGCGATGAAG GCCAAGCAGT ACGACCTGGT GCTCATGGAC TGCGAGATGC CGGTGCTCGA CGGCTTCGCC GCCACCGAGC AGTTGCGCGC CTGGGAAGCC AGCGAACTCC GTCCGCACAC GCCGATCGTC GCGCTCACCG CGCATATCCT CAGCGAGCAC CGCGAGCATG CCCAGCGGGT CGGCATGGAC GGCCACCTCG CCAAGCCGGT GGAGTTGTCG CAACTGCGCG CCCTGATCGA ACACTGGGTC GAGCGCAAGG AGGAGTCCTA CCGCCACGCG CTTCATCCCT GA
|
Protein sequence | MRRPWNATTL IVSLLLVVLA ALPTLAGSLL RSDEQFGPLG PGAPNGDWRI LLDESASLTL KDVIERRDHF APLGHRSLTL PANQAAWLRV SIAGHDTPRW IWVFAPRVDR VDFFLTNRGA TERRIETGAM LPDGLSTSGQ AHLFDLPTDQ TTREVWLRLA PRQAAPAWFD YVDTAGLLDK NKLAYTLGAL LSALALGMIY HLVRFGYNRV LCNLWLSAMQ GTLLLSAIAH FGLLGAWLPQ LGHYQIRVAD ITALLSFLFL LQFTRSFFDQ EESPRLLDWL LRGEIAVLAL AAVLDGASPH DMLRSGLILL ASLSATGCAL YHWLHGYRPA RLALVGTLLF HLGFSAYLSM LLDFGQFGPC WQIFALFAFT AASGLLLVFA LAERQCKIER DRAAWRTSEA ASSAELRAKS EFLAKLSHEI RTPMNGVLGM TELLLGTPLT AKQYDYVQTI NSSGNELLSL INDILDISRL ESGQIELDDV QFDLNGLIED CLGIFRAKAE RRHVELISFI QPQVPRVING DPTRLRQILL TLIENAFRQT DEGEILLKVS LETEGPRPNL HVAVQDSGEP MPESRREAML NSRVQSSDLL SAAQIDGSLD LIIARQLITL MHGICGIDAD PTRGTTLWLS LPLDPARLEH PVADLDSPLQ DARLLVVDDN DTCRKVLLQQ CGAWGMKVSA VSSGKEALAL LRTKADLGEY FDAALLDQNM PGMSGLQLAQ RIKEDPGLNH DILLVMLTGL SDAPGRVYAR NAGITCSLAK PVAGYTLKTT LAEELGKRRA ARPAASGLEE NAPLEVPSDF RILVAEDNSI STKVIRGMLR KLNLQPDTAS NGEEALQAMK AKQYDLVLMD CEMPVLDGFA ATEQLRAWEA SELRPHTPIV ALTAHILSEH REHAQRVGMD GHLAKPVELS QLRALIEHWV ERKEESYRHA LHP
|
| |