Gene GSU1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1990 
Symbol 
ID2686138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2183375 
End bp2184382 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content56% 
IMG OID637126681 
Productsensor histidine kinase 
Protein accessionNP_953039 
Protein GI39997088 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTATGA GACGATACTC ATTTATGGTC GACGATCAAC TGCGGCTCGT GGCACGAGAC 
GATCTGCACC GTTCGGTGGG GAACGATCCC TTGTTGGGTA TTCCCTATTA CGAATGCTAT
CCCCGGCTTT GGAACGGCAA CGTTGATGCC GTTCGTACCG TCGTGGATCA GGGAGAGCCT
CTGGTTCTCA ACGGTTATCG CCACTACTGC TTTTATGGCG AGTTCAGTGC GGCAGACATT
GAAATTCGCC CCGTTTTCGA TGGTGCCTCC CGATGCACCG GCGCAATGGT GCAGGTTTCC
GTGCTCCCTG GATGTACGGT CTATGGTGAA ATGGACAGCG CCCGGGCGCT GATAGACATC
GGCAAGATGT CGGTGACGCT TGCCCATGGC GTACGCAACC CGCTCAATGC CATCAAGGGC
GCCGTGGTCT ATCTCAAGGA CAAGTACTGC ACTGACGCAA CCTTTGCCGA ATTCGCCGAT
ATTATCGACG AAGAGATTTG CAAGCTCGAC GGGTTCATTA CCGAGTTCCT CGGTACCTCG
CACCTGGAAC CGGTCAGAGA GGAAATCCAG CTCAACGATC TCCTTGAGCG CGTCGTAAAA
TTCGTTTCCC TGCAGGCAGA TGCGAATCAT GTGCGATTTG ACGTGGAGTA CGGTGAGTTG
CCGCTCGTCA TGCTCGATTC GTTCAATTTC GGTCACGCCA TACTCAATAT CGTCAACAAC
GCTCTCGGGG CCATGGAGGC GGGAGGCTCC CTTACCATGC GAACCAGCAC ATTGCTGGAA
GGTGGAGTGG AGATGATTGT GGTCGAGGTG GCCGATACGG GGTCGGGCCT TCGGTCGGGG
GGCAAGGGGA TGCTTGGCAG TTTTCCCGGA GCCGGCAGCA GGAAGCAGGG GAAAGGGTAC
GGCCTGTTCA TCACCCGTGA GATTGTACGC CACCACCAGG GGAAGATTGA AATAACCGGG
AATCATGAAG GCGGAACCAC CGTCAAGATC ATGTTGCCGG CGGTATAG
 
Protein sequence
MVMRRYSFMV DDQLRLVARD DLHRSVGNDP LLGIPYYECY PRLWNGNVDA VRTVVDQGEP 
LVLNGYRHYC FYGEFSAADI EIRPVFDGAS RCTGAMVQVS VLPGCTVYGE MDSARALIDI
GKMSVTLAHG VRNPLNAIKG AVVYLKDKYC TDATFAEFAD IIDEEICKLD GFITEFLGTS
HLEPVREEIQ LNDLLERVVK FVSLQADANH VRFDVEYGEL PLVMLDSFNF GHAILNIVNN
ALGAMEAGGS LTMRTSTLLE GGVEMIVVEV ADTGSGLRSG GKGMLGSFPG AGSRKQGKGY
GLFITREIVR HHQGKIEITG NHEGGTTVKI MLPAV