Gene GSU0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0475 
Symbol 
ID2686122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp506171 
End bp508039 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content60% 
IMG OID637125142 
Productsensory box histidine kinase 
Protein accessionNP_951534 
Protein GI39995583 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCT CCGACCTCGT ATCGCGAAGC GCACGCTTCG CCCTGGTGAT GCTGTTCCTC 
GCCGCGCTCG TCCTGACGAG CCGCGTCAAC TATCTCTTAT TTCATACGCT TACCGAAATC
GTCACCGTAG TCGCGGGATG CGGCATCTTC ATGGTCGCTT GGCACGCCCG GCGCCAGATC
GATAACCACT GCATTCTGCT CATCGGCATC TCCCACCTGT TCGTGGCAAT CATTACCCTG
TTCCACGCCC TGAGCTACCG GGGGATGGGG GTCTTCCCCG GCGCGGGCAC AAACCTCTCC
ACCCAGCTCT GGATCGGCTC GCGACTGCTG CAGGGAGCGT CGCTGGCGCT TGCACCGCTG
TTCGTCAGAA AGCGGCTCAA TCCCTCTGTT ACCGTTACCG CTTACATGGC CGCAACGTCA
CTTTTCCTGC TGTCAATCCT GTGCGGGGAT ATCTTCCCCC TCTGCTTCTC GGACGAGGTG
GGGCTCACCC CGTTCAAAAA GGGCGCCGAG TTCCTGGCTG GGATATTCAT TCTCGCTGCC
CTGACAGGCT TGCTGACCAA GCGCCGGCAC TTCGACGCCC GAGTGTTCCG GATGCTTTCG
CTCTCCCTGG TCTTTCTTGC AACGAGCGGC TTCGTTTTTG CTCTCAACAC CGGCATCTCC
AGCCTCACCG GCATGACGGG ACATCTGCTT ACGCTCGGCG GCTTCATACT CATCTACCGC
GCCATGGTGG AAACGGGGCT GGAACGCCCC TACGATCTGC TGTTCCGTGA CCTGAAAGAA
AGCGAAGAGC GCTACCGCAG CCTCTACAAC AGGACTCCCG TCATGCTCCA CTCCATTGAC
CGGGAGGGAA AAATCGTCAA TGTCAGCGAT TTTTGGCTGG AAACCCTCGG CTACCGGCGC
GACGAGGTGC TGGGCAGGCT TTCCGCCGAT TTCATGACTG ACGAGTCACG GTCGTACGTG
ATCGGGACGG TCGTCCCGGA ATTCCTCCGT ACCGGACGCA CCAGAGATAT CCCGCTCCAT
CTGCTGACCA GCAGCGGAAA GGTCATCGAC GTTCTCCTTT CGTCTGAGGC GGAACGAGAT
GAAGAGGGAG AAATCGTGCG TTCCCTGTCG GTCATGACCG ACGTGACGGA GCAGCGGCGT
GCGGCCCGGC AGATCGAGCG ACTCAACGAA AGTCTCGCCT CCCGGGCCAT GGACCTGGAA
GTGGCCAATG GCGACCTGGA GGCGTTCAAC TACAGCGTCT CGCACGATCT GAGATCGCAC
CTGACCGTGA TCCGGGGCTT CAGCGACGTT CTGCTCGAGA TCTGCACAGA CAAACTCGAC
GATGAATGCC GCAGCTACGT GCGTCACATC GGGGAAGAGA CGGGGCGCAT GAACGGACTC
ATCGGCACCC TGCTCGACTT TTCCCGCGTG GCCCGCGTAG AACTGGAACG GGTACCGGTC
AACCTGAGCA CGCTGGCAGA GGAAATTGCC CTGGAGCTCA GGATGAAGGA CCAGGAGCGC
ATGGCAGAGT TCATCATTAT CGATGACGCC GACGTGACCG CCGATCCGGG GCTCATGAGG
GTTGTGATGG AGAATCTGCT GGGCAATGCC TGGAAATACA CGGGCAGGCG GGAACAGGCC
GTAATCGAGT TCGGCAAGGA AGAGATGGAG GGTCAAACCG TGTTTTTCGT CCGGGACAAC
GGAGCGGGGT TCTCGTCACA ACAGGCCGAC AAACTCTTTC TTCCCTTCCA GCGCCTCCAC
GGCCGGAGCG AATTCCCGGG GCACGGCATC GGCCTCGCAA CGGTCCACAG GATCATCTCC
CGCCATGGTG GCACAATCTG GGCCCAAGGG GAAGAGGGCG CCGGAGCCGT ATTCTATTTT
ACGCTGTAA
 
Protein sequence
MTSSDLVSRS ARFALVMLFL AALVLTSRVN YLLFHTLTEI VTVVAGCGIF MVAWHARRQI 
DNHCILLIGI SHLFVAIITL FHALSYRGMG VFPGAGTNLS TQLWIGSRLL QGASLALAPL
FVRKRLNPSV TVTAYMAATS LFLLSILCGD IFPLCFSDEV GLTPFKKGAE FLAGIFILAA
LTGLLTKRRH FDARVFRMLS LSLVFLATSG FVFALNTGIS SLTGMTGHLL TLGGFILIYR
AMVETGLERP YDLLFRDLKE SEERYRSLYN RTPVMLHSID REGKIVNVSD FWLETLGYRR
DEVLGRLSAD FMTDESRSYV IGTVVPEFLR TGRTRDIPLH LLTSSGKVID VLLSSEAERD
EEGEIVRSLS VMTDVTEQRR AARQIERLNE SLASRAMDLE VANGDLEAFN YSVSHDLRSH
LTVIRGFSDV LLEICTDKLD DECRSYVRHI GEETGRMNGL IGTLLDFSRV ARVELERVPV
NLSTLAEEIA LELRMKDQER MAEFIIIDDA DVTADPGLMR VVMENLLGNA WKYTGRREQA
VIEFGKEEME GQTVFFVRDN GAGFSSQQAD KLFLPFQRLH GRSEFPGHGI GLATVHRIIS
RHGGTIWAQG EEGAGAVFYF TL