Gene GSU0303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0303 
Symbol 
ID2686969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp333197 
End bp334414 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID637124969 
Productsensory box protein 
Protein accessionNP_951363 
Protein GI39995412 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTGGT TTGCCAACCT GAAAATCACC CCAAAATTCA TAGTTATTCT TTGTGTCCTG 
TTTGTAGCTC TCATGGGTGT CAATGCCGTT GACGATTATC TGCGGCAGGA GTCGTTGATC
GTCAAGGATG CCACCGATAA TGCCCGTATC CTTGCCCGGC AGATCGTTGA AACCCGGGAA
TATATGTCTT CGGTGGTCCG GGGCGAACCG GAGGCGAACT ACAACCTCGT CCCCCAGGTG
GTCGCTACCC AGGTGGCCAA GCGGATCACG ACCGGCAGCA AGTACTACGT CCGTCAGGTG
TCGTTACGCT ACCGCAATCC CGCCAACCGC CCCGATGCCT ATGAAACGGC GATGCTTCAG
CGGTTCGGCA GGGAAAAGGC TGCCGAGCAG TGGGAGGTGG TAACGATCGA CGGCAAACGG
GTCTTCCGCT ATCTGCTGCC CATGACTGCC GACGCCTCAT GTCTCGGTTG CCACGGCCGC
TACGAGGAGG CCCCGGCCTT CGTCCAGGCG CGGTTCCCCA AGGGGCACCC CTCCTATGAT
TACCAGATCG GCCAACTGAT CGGCGCCGTG TCAGTTTCCA TTCCCATGGC GGATCTCTAT
CGCCAGATCG GTATCAACCT CAAGCTCGAT CTGGCAATCG TCTCGGGCGT AGTCCTCCTC
ATCCTCGCGG CCATGAGCTT TATGGTCCAC CGCTCCATTA TCAAGCCGGT CCGGTCGGTG
GCAACCTCCA TTGGCAACGT GGCCGCTACG GGGAATTTTT CGGAACGGCT CAGGAGAAGT
TCCAACGACG AGGTGGGAGA GCTGGTGGGA GCGTTCAACG AACTCATGGA GGAGTTGGAT
CGCACGACGC GTCAACGGCA GGAGTCTGAA GACCGCTACC GCAACGTGCT GGAGATGGCC
CAGTCCGCCA TTGTCACCTT TTTGGCCGAC GGGAAGCTCA TCATTACCAA CCGCAAGGCA
GAGGACCTGT TCGGCCTTCC CAAGGGAGAG CTGCTGGGTG TCTCCATCTA CACCTTCATG
GAGGAGGGGG ATGCGGTAAA GGGCGCCATC GAGACCTATC TGAGGGAAGG GGCGGCAGCC
ATGATCGGGG AGACGACCCC GCAGCGGATT CGCAGCATCC GCGGGAATGT GACAACGGTG
GAAATGGCCC TCTCGGTGTC CCGGTCGGAC CACAACCCCC TCTTTACCGC TATTTTGCGA
GAAAACGGCC GGTCCTAG
 
Protein sequence
MSWFANLKIT PKFIVILCVL FVALMGVNAV DDYLRQESLI VKDATDNARI LARQIVETRE 
YMSSVVRGEP EANYNLVPQV VATQVAKRIT TGSKYYVRQV SLRYRNPANR PDAYETAMLQ
RFGREKAAEQ WEVVTIDGKR VFRYLLPMTA DASCLGCHGR YEEAPAFVQA RFPKGHPSYD
YQIGQLIGAV SVSIPMADLY RQIGINLKLD LAIVSGVVLL ILAAMSFMVH RSIIKPVRSV
ATSIGNVAAT GNFSERLRRS SNDEVGELVG AFNELMEELD RTTRQRQESE DRYRNVLEMA
QSAIVTFLAD GKLIITNRKA EDLFGLPKGE LLGVSIYTFM EEGDAVKGAI ETYLREGAAA
MIGETTPQRI RSIRGNVTTV EMALSVSRSD HNPLFTAILR ENGRS