Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_0942 |
Symbol | |
ID | 3744280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 1061807 |
End bp | 1064836 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637768976 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_374847 |
Protein GI | 78186804 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.808539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.370393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAA AGACGTTCAT CCCTGATGCA TCGGCAATCG ACACCATCAT CGATCTCGTT CCGCACGAAG CTGCCGTCAT AGACCCTCAC GGCTGCATTC TCCATTCCAA TGGCCTCTTT CAGAGAGGGC TTGGCTCGGG AAGCCCTCTG TCTTTCGGCC GGAACCTTTT TGATATTATC TCTTCCAGCC CGCTCCGCAC CATCAATACT GAAGAGCTTT CCGTGCTCTG CCGGAACGTT CTGGAAACGG GGCTTCCTTC GGGCAACATC CGTCCGCTCA GCTCTCCGGA CGGCGCCGTT CAGGCACTGG TCCTCACCTT TCCCGACGAT GGTGCCGGAG GCAGAACGCC GGCCGATGCC AACCAGAACA TCTCCCGGGC ACTGAACGCA GCCGATGCCG GCACCTGGCA GTGGAATCTG AAGTCGGGAG CCATTCAATG GTCGGAAGGG CTGTGGCGTC TTTTTGAAAT GCAGAACACG GGCGAGCCGC CCTCGCTGGC GATATGGGAG GAGCGGCTCG ACCCGACGAA CCGCGACTCG ACCTTGCAGG CCTTTGCAGC CTGCAGCTCC TCCGCCATGC TCATGCATGT CGAGTATTGC ATCATTCTCG ACGACGGGAG CCGGCGGTGG GTCATGAGCC GCGGCAGGCC CTATCACGAC AGCCGGGGAG CGATGGAGGG CTACATCGGC ATCACGATTG ACATAACCGA ACAGAAAAAG CTTGAAGCAG AGCTGCGTGA CAACAAGGCA CGGTTTTCCT TTGCGCTCGG GGCTGCCTGT TCGGGCATCT GGGAGTGGAA TATTGAAACC GATGAACTGC TGTGGTCCGA AGAGGTATGG CGGCTGTACG GACTTTCTCC CGGCTCCGAG ACTTTGAACC ATCAGCTTTG TGTGTCGACG GTGCATGAAG AGGACCGCGC GATAGTCAGC CATGTGATCA AGGATGCGGT GACGGGCGGC AGGGATGCTT CGGTGGAGTT CCGGGTGAAA TATCCGGGCG ATGGTTCCCT GCACTGGCTC CAGTCCAAGG GTTCGCCGAA ACGAGATGCT GACGGAACTG TGGTGAAGTA CATCGGACTC ATTACCGACA TCACCGAGCG CAAGCTTGCG GAAATAGAAC TGATCGAAAA CCGGAGCCGG CTCGAGCAGG CTCTGGCTGC GGCGAAGGCC GGCATCTGGG AGTGGGATCT CCTTACTGGA CGCAATACCT GGTCGGATGA GGTATGGGCC CTTTACGGAC TTGAGCCGCA CTCCCGTGAG CCGTCTTTTG CCGTATGGGC GGAAACGCTC CATCCCGAGG ACAGGGATGC AGCCTGCCTC TCGGTGCAGT CCGCTTCGGC TGAAGGGAGG GAGATTTCCT TCGAATACCG GGTTCCGAGG GCGGACGGCT CGATGCTCTG GCTGCTTTCG AGGGGGCAGC CCCGCAGGGA TGCCTCCGGA GCCATCGTCA AATACCTCGG GACGGTCATC GACGTCACCG AACAGAAATC GCTTGAGGAG CAGCTGCTCA AAAGCAAGGC CCGCATGGCT TTTGCCCTTG AGGCCACGAA AGCGGGCGTA TGGGAGTGGG ATCTGAAGCA TGATCAGGTG TTCTGGTCTG ACCGGATCTG GCAGCTCTAC GGCCTTGAGT CCGGCAGCAA ACCCAACAAT CACAAGCTCT GCGAAAGCAA TGTCCTGCCG GAGGACAGGG AGCAGACGTT CGGTATCGTC ATGCAGGCGG CCAATCGTGA AATCGAGATC GATATCGAGT TCCGGGTATG CCACCCTGAG GACGGTACCA TCCACTGGCT GGCCTGCAGG GGCAAACCAC AGCTTGATGC CCAGGGCGCT TTAGACCGGT ATGTCGGCAC CATCATGGAC ATCACCGACC GCAAACGGCT CGATGACGAG CTGCGGGAAA ATGAACGGAA ATTCAGGGGC ATTTTCGACA ATGCTCCGAT AGCCATCAGC ATCAAGGAGA TCGATACGGG CAAGATCATT GATGTCAATG ATGCCTGGCT CAACCTGCTC GGCTACACTC TTGTTGAGGT ACTCGGCCGC ACAGGACCCG ATATCGGCAT CCATGCCTGC AGGGAGGACT ATCAGGCCAT TGAGATGGCC TCCCTGAAGC GCGAACGGAT ATGCAACCGC CAGGTCCTTC TGCACGAAAA GAACGGTGAC CTGGTCGATG TGCTCTACTC CACGGAATTC ATCGAGTTCG AGGGCATGGC GGTCATGCTG GTCATGATGG TCGATGTCAC GCTTGAGAAA ATGCAGCAGC AGAACATCAG CCGCCTTGAG CAGTCGATTG CCGACCGCAA TATCCAGCTC CAGAAAGAGG TGGAGCGGCT GCACCGGTTC CTGAGCATGA TCTCCCATGA ATACCGGACG CCCCTGGCCA TCATGAGGAC CAATCTCGAC CTCGTCAAGA TGAAGAACAA GATGGGGAAT TTCCAGAACC GTCAGGAGTT TGCAAAGATT GACCGGAGCA TTGCCCGCCT GGTCGAGGTG CTTGAGGTCT CCATCCAGGA AAGCCGGATG GCCGACCGTC ATAAATCGGT GACGGGCAGG CACGAACTCC CGCTTGCCGG CATCATCAAA TCCCAGGTCG AAGCGTTTTC CGGGATATGG AAAGAAAGAA GCGTCTTGTT TGAGAACTGC CTGGAGGATG TGCGGGTATA CGGGGAGGAG TCCGGGGTCA AATTTGCCAT TTTCAACCTG CTTGACAATG CCCGGAAGTA TTCACCCGAT GAGTCGCCAA TCAGCATCAC CTGCCGGAAG GCCGGGCCGG GTCATATCAT GATTGATGTC GGCAACGAGC TGCGCGAACC CCTTGAAGAC GTGGACATGG ACCTGTTCTT CGAGAAATAT CATCGGGGAG GGAACTCCAG CAATACGGCA GGGGCGGGTC TGGGGCTCTG GCTCGTGAAG AACATCGTCA CCCAGCACGA GGGACATATT GCGCTTTCCA AACAGGCCTC AGGGATTGTC GTGACGATCA CGCTTCCCGT CATTGCATCC TTAGGCGGAG ACAGACAGGC GAAACCATGA
|
Protein sequence | MAKKTFIPDA SAIDTIIDLV PHEAAVIDPH GCILHSNGLF QRGLGSGSPL SFGRNLFDII SSSPLRTINT EELSVLCRNV LETGLPSGNI RPLSSPDGAV QALVLTFPDD GAGGRTPADA NQNISRALNA ADAGTWQWNL KSGAIQWSEG LWRLFEMQNT GEPPSLAIWE ERLDPTNRDS TLQAFAACSS SAMLMHVEYC IILDDGSRRW VMSRGRPYHD SRGAMEGYIG ITIDITEQKK LEAELRDNKA RFSFALGAAC SGIWEWNIET DELLWSEEVW RLYGLSPGSE TLNHQLCVST VHEEDRAIVS HVIKDAVTGG RDASVEFRVK YPGDGSLHWL QSKGSPKRDA DGTVVKYIGL ITDITERKLA EIELIENRSR LEQALAAAKA GIWEWDLLTG RNTWSDEVWA LYGLEPHSRE PSFAVWAETL HPEDRDAACL SVQSASAEGR EISFEYRVPR ADGSMLWLLS RGQPRRDASG AIVKYLGTVI DVTEQKSLEE QLLKSKARMA FALEATKAGV WEWDLKHDQV FWSDRIWQLY GLESGSKPNN HKLCESNVLP EDREQTFGIV MQAANREIEI DIEFRVCHPE DGTIHWLACR GKPQLDAQGA LDRYVGTIMD ITDRKRLDDE LRENERKFRG IFDNAPIAIS IKEIDTGKII DVNDAWLNLL GYTLVEVLGR TGPDIGIHAC REDYQAIEMA SLKRERICNR QVLLHEKNGD LVDVLYSTEF IEFEGMAVML VMMVDVTLEK MQQQNISRLE QSIADRNIQL QKEVERLHRF LSMISHEYRT PLAIMRTNLD LVKMKNKMGN FQNRQEFAKI DRSIARLVEV LEVSIQESRM ADRHKSVTGR HELPLAGIIK SQVEAFSGIW KERSVLFENC LEDVRVYGEE SGVKFAIFNL LDNARKYSPD ESPISITCRK AGPGHIMIDV GNELREPLED VDMDLFFEKY HRGGNSSNTA GAGLGLWLVK NIVTQHEGHI ALSKQASGIV VTITLPVIAS LGGDRQAKP
|
| |