Gene GSU1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1043 
Symbol 
ID2686972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1129626 
End bp1131929 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content60% 
IMG OID637125712 
Productsensory box histidine kinase 
Protein accessionNP_952096 
Protein GI39996145 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain
[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.382651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAGA ACAGGAACTC GTCTGAAACC GTACCGCTCC TCGTCATAGG AACCGGAATC 
ATCGTCGCGC TCTACCTGAC GAGCTTTTAC AGCTTCGTGC TGTTCCACGT CCTAGTTGAA
TTCTTCTCCG TGGTCGTGTC GGGCGCGATC TTCGTCATCG CCTGGAACTC GCGCCGCTTT
GCCGCCAACG GGTACCTGCT GGTCATCGGC ATCGCCCACC TCTGCGTCGG CAGCATCGAC
CTGCTGCACA CCATCGCCTA CAAGGGCATG GGGCTCTTCC CCGGCTTCGA CGCCAACCTC
CCCACCCAAC TCTGGATAGC CGGGCGCTAC CTGCAGAGCG CCTCGTTCCT CCTGGCACCT
TTCTTCATCG GCCGCCGGCT CCTGTCGCGC ACCCTGCTGA CTGCATACCT GGCCGTCACC
TCGCTGCTGC TCGCCGCCAT CTTTGTCTGG CAGATCTTTC CCGTCTGCTT CGTGGAGGGA
GTGGGGCTTA CCCCCTTCAA AAGGGTCAGC GAATACATCA TCTCCGCAAC ATTCGTGGCG
GCCCTGGTCC TGTTGCGAGT CAAGCGCAGG GCGTTCGACC GCCGGGTCGC CAATTATCTG
TCGGGAGCCA TTGCTCTCAT GGTGGGCGCC GAACTGGCCT TCACCTTCTA CATCGACGTC
TACGGCCTGA GCAATCTGGT CGGCCACCTG TTCAAGGCGG CGGCCGTGCT GCTCATCTAC
CGGGGGCTGG TCGAGACGGC CCTGACCCGC CCCTACGATA TCCTGTTCCG GGAACTGACG
GAACGGGAAG AGGCCGTGCG GGAGTCCCAC CAGCGCATCT CCACCATCCT CGAAAGCATT
ACCGATGCGT TCTTTTCACT GGATCACTCA TGGCGCTTCA CCTACCTGAA CGGGGAGGCG
GAGCGATTGC TCGGGCGCAC CCGAGACGAC CTGCTGGGAA AGAGTATCTG GGAGGAATTC
GCCGCCGCCG TGGGCACAAG GTTTGACGTC GAGTACCGTC GCGCCGTGGC AAGCGGAACG
ACTGCCACGT TCGAGGAGTA CTACCCTCCC CTCGCCTGCT GGTTCGAAGT CCACGCCTAC
CCCTCGCGGG ATGGGCTCTC GGTCTTTTTC CAGGACATTA CCACCCGTAA GCGGGCCGAA
GCCGAGCTCC GCGCTTCGGA AGAGCGCTTT GCAAAGGCAT TCAACACCGC GCCCACCATC
ATGATCATCG CATCCCTTGT CGACGGACGG TACCTGGAGG TAAACGGAGC GTTCGAAAAA
ACCCTTGGCT GGCGGAAGGA AGAGGTGCTC GGCCGCACAT CGTACGACCT CGGCATCTGG
ATTGACACCG CCGAACGCGA AAACATCCTG CGCGAGGTTG CCGAACAGGG ATCGGTCCAT
GACCGGGAAA TCCAGTTCCG CAGCAGGAAC GGTGAAACGA TCATCGGCCT TTACTCGGGC
GTCATCATTG AACTGAACGG CAAGCAGTGC CTGTTAAGCA TTGTACGGAA CATTACTGCC
CGCAAGCGGG CCGAGCAGCA GATCGTCATC CTCAACAAGG AATTGGAGGA CCGGGCGGAG
GTGCTGGAGG AAACCAACTG CGAACTGGAG ACGAGTGTCG AGCAACTGGA AGCGGTAAAC
CGCGAACTTG AGGACGCCAA CGAAGAGCTG GAGGCGTTCA ACTATTCAGT CTCTCATGAC
CTGCGCCGTC CCCTCACCAA CATCAACGGG TTCTCCCAGC TCATCCTGGA GTTGTACGGC
CGTCAACTGG AGGCACAGTG CCGCGATTTC GTCCGCTGCA TTTACGACGA AACGCGCAAC
ATGGATCGCC TTATCGGGAC ACTGCTCAAC TTTTCCCGGA TCTCGCGCTG TGCGATGAAG
CCCGAAACGG TCGACCTGAG CACCATGGCC CGGCAGATAA CCGAAACCCT GAAAATAGGC
GAGCCGGAGC GCAAGGTCAG CTTCCACCTG ACCGAGGGAA TAACGGCCCG GGGAGACGCG
GGGCTGCTCC ATATCGTACT CGACAACCTG GTGGGAAATG CGTGGAAATA CAGCAGCAAG
CAGGAAGACG CCCGCATCGA GTTCGGCGTG ACCGACAGGA GCGGCACCGT CGCCTACTTC
GTGCGCGACA ACGGAGCCGG CTTTGACATG GAGTTCGCGC AACAGCTGTT CACGCCGTTC
CAGAGGCTCC ATCACAGCGA GGATTTCGAA GGGCACGGCA TCGGCCTTGC CACGGTGGCG
CGGATCATCC AGCGCCACGG CGGAAAGGTC TGGGCGGAAG GAGAGGTAGG ACAGGGAGCT
ACCTTTTACT TCACCCTCTG CTGA
 
Protein sequence
MIENRNSSET VPLLVIGTGI IVALYLTSFY SFVLFHVLVE FFSVVVSGAI FVIAWNSRRF 
AANGYLLVIG IAHLCVGSID LLHTIAYKGM GLFPGFDANL PTQLWIAGRY LQSASFLLAP
FFIGRRLLSR TLLTAYLAVT SLLLAAIFVW QIFPVCFVEG VGLTPFKRVS EYIISATFVA
ALVLLRVKRR AFDRRVANYL SGAIALMVGA ELAFTFYIDV YGLSNLVGHL FKAAAVLLIY
RGLVETALTR PYDILFRELT EREEAVRESH QRISTILESI TDAFFSLDHS WRFTYLNGEA
ERLLGRTRDD LLGKSIWEEF AAAVGTRFDV EYRRAVASGT TATFEEYYPP LACWFEVHAY
PSRDGLSVFF QDITTRKRAE AELRASEERF AKAFNTAPTI MIIASLVDGR YLEVNGAFEK
TLGWRKEEVL GRTSYDLGIW IDTAERENIL REVAEQGSVH DREIQFRSRN GETIIGLYSG
VIIELNGKQC LLSIVRNITA RKRAEQQIVI LNKELEDRAE VLEETNCELE TSVEQLEAVN
RELEDANEEL EAFNYSVSHD LRRPLTNING FSQLILELYG RQLEAQCRDF VRCIYDETRN
MDRLIGTLLN FSRISRCAMK PETVDLSTMA RQITETLKIG EPERKVSFHL TEGITARGDA
GLLHIVLDNL VGNAWKYSSK QEDARIEFGV TDRSGTVAYF VRDNGAGFDM EFAQQLFTPF
QRLHHSEDFE GHGIGLATVA RIIQRHGGKV WAEGEVGQGA TFYFTLC