Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0475 |
Symbol | |
ID | 2686122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 506171 |
End bp | 508039 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637125142 |
Product | sensory box histidine kinase |
Protein accession | NP_951534 |
Protein GI | 39995583 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAGCT CCGACCTCGT ATCGCGAAGC GCACGCTTCG CCCTGGTGAT GCTGTTCCTC GCCGCGCTCG TCCTGACGAG CCGCGTCAAC TATCTCTTAT TTCATACGCT TACCGAAATC GTCACCGTAG TCGCGGGATG CGGCATCTTC ATGGTCGCTT GGCACGCCCG GCGCCAGATC GATAACCACT GCATTCTGCT CATCGGCATC TCCCACCTGT TCGTGGCAAT CATTACCCTG TTCCACGCCC TGAGCTACCG GGGGATGGGG GTCTTCCCCG GCGCGGGCAC AAACCTCTCC ACCCAGCTCT GGATCGGCTC GCGACTGCTG CAGGGAGCGT CGCTGGCGCT TGCACCGCTG TTCGTCAGAA AGCGGCTCAA TCCCTCTGTT ACCGTTACCG CTTACATGGC CGCAACGTCA CTTTTCCTGC TGTCAATCCT GTGCGGGGAT ATCTTCCCCC TCTGCTTCTC GGACGAGGTG GGGCTCACCC CGTTCAAAAA GGGCGCCGAG TTCCTGGCTG GGATATTCAT TCTCGCTGCC CTGACAGGCT TGCTGACCAA GCGCCGGCAC TTCGACGCCC GAGTGTTCCG GATGCTTTCG CTCTCCCTGG TCTTTCTTGC AACGAGCGGC TTCGTTTTTG CTCTCAACAC CGGCATCTCC AGCCTCACCG GCATGACGGG ACATCTGCTT ACGCTCGGCG GCTTCATACT CATCTACCGC GCCATGGTGG AAACGGGGCT GGAACGCCCC TACGATCTGC TGTTCCGTGA CCTGAAAGAA AGCGAAGAGC GCTACCGCAG CCTCTACAAC AGGACTCCCG TCATGCTCCA CTCCATTGAC CGGGAGGGAA AAATCGTCAA TGTCAGCGAT TTTTGGCTGG AAACCCTCGG CTACCGGCGC GACGAGGTGC TGGGCAGGCT TTCCGCCGAT TTCATGACTG ACGAGTCACG GTCGTACGTG ATCGGGACGG TCGTCCCGGA ATTCCTCCGT ACCGGACGCA CCAGAGATAT CCCGCTCCAT CTGCTGACCA GCAGCGGAAA GGTCATCGAC GTTCTCCTTT CGTCTGAGGC GGAACGAGAT GAAGAGGGAG AAATCGTGCG TTCCCTGTCG GTCATGACCG ACGTGACGGA GCAGCGGCGT GCGGCCCGGC AGATCGAGCG ACTCAACGAA AGTCTCGCCT CCCGGGCCAT GGACCTGGAA GTGGCCAATG GCGACCTGGA GGCGTTCAAC TACAGCGTCT CGCACGATCT GAGATCGCAC CTGACCGTGA TCCGGGGCTT CAGCGACGTT CTGCTCGAGA TCTGCACAGA CAAACTCGAC GATGAATGCC GCAGCTACGT GCGTCACATC GGGGAAGAGA CGGGGCGCAT GAACGGACTC ATCGGCACCC TGCTCGACTT TTCCCGCGTG GCCCGCGTAG AACTGGAACG GGTACCGGTC AACCTGAGCA CGCTGGCAGA GGAAATTGCC CTGGAGCTCA GGATGAAGGA CCAGGAGCGC ATGGCAGAGT TCATCATTAT CGATGACGCC GACGTGACCG CCGATCCGGG GCTCATGAGG GTTGTGATGG AGAATCTGCT GGGCAATGCC TGGAAATACA CGGGCAGGCG GGAACAGGCC GTAATCGAGT TCGGCAAGGA AGAGATGGAG GGTCAAACCG TGTTTTTCGT CCGGGACAAC GGAGCGGGGT TCTCGTCACA ACAGGCCGAC AAACTCTTTC TTCCCTTCCA GCGCCTCCAC GGCCGGAGCG AATTCCCGGG GCACGGCATC GGCCTCGCAA CGGTCCACAG GATCATCTCC CGCCATGGTG GCACAATCTG GGCCCAAGGG GAAGAGGGCG CCGGAGCCGT ATTCTATTTT ACGCTGTAA
|
Protein sequence | MTSSDLVSRS ARFALVMLFL AALVLTSRVN YLLFHTLTEI VTVVAGCGIF MVAWHARRQI DNHCILLIGI SHLFVAIITL FHALSYRGMG VFPGAGTNLS TQLWIGSRLL QGASLALAPL FVRKRLNPSV TVTAYMAATS LFLLSILCGD IFPLCFSDEV GLTPFKKGAE FLAGIFILAA LTGLLTKRRH FDARVFRMLS LSLVFLATSG FVFALNTGIS SLTGMTGHLL TLGGFILIYR AMVETGLERP YDLLFRDLKE SEERYRSLYN RTPVMLHSID REGKIVNVSD FWLETLGYRR DEVLGRLSAD FMTDESRSYV IGTVVPEFLR TGRTRDIPLH LLTSSGKVID VLLSSEAERD EEGEIVRSLS VMTDVTEQRR AARQIERLNE SLASRAMDLE VANGDLEAFN YSVSHDLRSH LTVIRGFSDV LLEICTDKLD DECRSYVRHI GEETGRMNGL IGTLLDFSRV ARVELERVPV NLSTLAEEIA LELRMKDQER MAEFIIIDDA DVTADPGLMR VVMENLLGNA WKYTGRREQA VIEFGKEEME GQTVFFVRDN GAGFSSQQAD KLFLPFQRLH GRSEFPGHGI GLATVHRIIS RHGGTIWAQG EEGAGAVFYF TL
|
| |