Gene GSU3230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3230 
Symbol 
ID2688311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3544670 
End bp3546226 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content62% 
IMG OID637127923 
Productsensory box histidine kinase 
Protein accessionNP_954271 
Protein GI39998320 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGTTG ATTCGGTGTT ATTCGCCGTC ACCCCCCGCC CGGGGGATAC TCCAAACCTG 
CGGAGGTATC CGCTCATGCT CGCGCTTCAG GACGACTCCC TTTTTGCCGC CATTGTCGAT
AATCTCTCCG ACGGTATCTA TGTCCTCCAG GACGACCGGG TGGTCTATCT GAATGAGCGT
TTTGCGCAGC TGTTCGGGCA TGGTGATGCA GGGTCCCTTC TCATGAGGGA GCTTGATGAC
ATCCTGCCCC ACGGCGAGGG CAAGGAGATC GTCGGCAGAA TCCACGCGGA GCTGCTGGCG
GGTGAGCCGT CATCGGTTGC CTGGGGCCAG CCCTTTGCCC ACATGGACGG TACGCCGGTC
TGGCTGGAAA TGGAGGCGCG CCGCATCCGC CTGAACGGGC GACCGGCCAT CCTCGGCGTC
TGTCATGACC GCACTGACTG CAAACTGATC GGCGAAGCCA TGCACATCAG TCAGGAGACA
TTCCGGCTGG TCCTTGACGC CATGCATGAC AGCGTGTACG TAGTGGGGCG CGACTACAAC
GTTATTTACG CCAACCGGGC CATGCGCGCC GGTGCCTACG GCGATGTGAG CTCTGATCCC
TGCTACCGCA TCTGCCGCGG CTGCAGTGAG CCATGCCCCG ACTGCACCCT GGAAGAAGTG
CTCTCCTCCG ACAAGCCCGT GTGCCGTGAA TTCTTTGATG AAAAGCGCAA CGCCTGGTAT
TCGACCATTG AACTTGCGGT CCGGATTCCC GGCATGAGTG CCCCCGCCAA GCTGGGCGTA
CGGCGGGACA TTACAGCCCG CAAGGAGGCG GAACAGCGCA GCCGGATGCT TTCCCAGCGA
CTCCTGCACG CCCAGGAGAG CGAGCGCAAG CGCCTTTCCC GGGAACTTCA CGACGATCTG
GGGCAGCTCC TCAATGCGCT GAAGATCGGA TTCGATACGC TGGCCGAAGA TCTGCGCCAG
CCGTCGCACG ATGTGCAGGA TCGCCTCTGG TATCTGAGCG AATCACTGAA TAATTCCATC
CACACCATTC GCCGCCTCTG CGCCGGTCTT CATCCCTCGA CCCTTGAGCG GCTGGGGCTG
GCCGAGACGA TCCGGCAGCA GTGCACCCAA GCGGCAACTA CCCATTCGCT CGACATCGAG
CTCAAGTGCG GGACCATGCT CGGCGTGCGG CTTGAGTCGG ATACTGAAAT CAACCTGTAC
CGGATCTTTC AGGAGGCCCT GCACAACGTG GTGAAACACG CTGACGCCAC CCGGGTAATG
GTACGGCTGG TGGCATCGCA TCCCACGGTA AGGCTCCAGA TCGAGGATAA CGGCGGCGGC
TTCTCCATGG AAGACTACGC GGAGCTCTCG GGGCACCACC TGGGACTCAT CAGCATGGCG
GAGCGGGTGG AATTGCTGGG TGGCACGTTC CAGATCGCGT CCCTCCCCGG TCTCGGAACG
GCGGTGACCG TGGAGGTCCC GTTCCGCAAG GCCCCGCCGG GCCGAACCAC TGCCGCGAAG
GGGCGGCCGA GCCTCCCCTA CCCGGTGCGC GGGAGGAAAT CACTCTATGT ATGCTAA
 
Protein sequence
MSVDSVLFAV TPRPGDTPNL RRYPLMLALQ DDSLFAAIVD NLSDGIYVLQ DDRVVYLNER 
FAQLFGHGDA GSLLMRELDD ILPHGEGKEI VGRIHAELLA GEPSSVAWGQ PFAHMDGTPV
WLEMEARRIR LNGRPAILGV CHDRTDCKLI GEAMHISQET FRLVLDAMHD SVYVVGRDYN
VIYANRAMRA GAYGDVSSDP CYRICRGCSE PCPDCTLEEV LSSDKPVCRE FFDEKRNAWY
STIELAVRIP GMSAPAKLGV RRDITARKEA EQRSRMLSQR LLHAQESERK RLSRELHDDL
GQLLNALKIG FDTLAEDLRQ PSHDVQDRLW YLSESLNNSI HTIRRLCAGL HPSTLERLGL
AETIRQQCTQ AATTHSLDIE LKCGTMLGVR LESDTEINLY RIFQEALHNV VKHADATRVM
VRLVASHPTV RLQIEDNGGG FSMEDYAELS GHHLGLISMA ERVELLGGTF QIASLPGLGT
AVTVEVPFRK APPGRTTAAK GRPSLPYPVR GRKSLYVC