Gene GSU2401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2401 
Symbol 
ID2686139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2634831 
End bp2637878 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content60% 
IMG OID637127091 
Productsensory box histidine kinase/response regulator 
Protein accessionNP_953447 
Protein GI39997496 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTCGG CCCCGTATGT TCGAGGAGGT ACCGTGGACT ATACCCTGAA GGATCTTCTG 
GACATTCCCG GACTGCAGGC GCTGCTCGAT TCTCTGCGGG CGCTTCATCA GCTTGCCTCC
TCCGTAATCG ACCCTGAAGG TACCATTCTC GCCGCTTCGG GCTGGCAGCG GCTCTGCACG
GAATATCACC GGGCACACCC ACCCTCCCGG CAAAAGTGCA TCGAAAACGA ACTCCGCGTT
GGCTCCCAGG CCGGCGAGTC ACTCATGCCT GTCATCCACC GCTGCCCCAT GGGACTTGAG
TATGCGGTCA CGCCAATCGT TATCGAGGGA GAGCACCTGG CGAATATCTT TGTCGGCCAG
ATTTTCACCT CCTCTCCGGA CGAGTCGTAT TTTGTCCGCC AGGCGCGGCA GTTTGGATTC
GACGAGGACG CCTACCTGGC GGAAATGCGG AATGTGCCGA TCTTTGACGA GGAAACGCTT
CACTCGTATC TCGCCCTTTT CCGCAGCATT GCCAACATGC TTGCGGAGCA GGGCCTGCAC
GTACTGCGAC AGCGCGTGGC CAACGAGGAG CTCCGGAAGA GCGAAAAACG CCACCAGACC
ATTCTCCAGG CCGCCCTCGG CGGGCTTTGC ACCATCGACA TGCAGGGCCG GCTGCTCGAA
GTCAACGAGG CCTATTGCCA GAAGATCGGC TACAGCCGGC AAGAACTGCT GACCATGAGC
ATCTTCGACC TGGAGGCAGC CGAAACGCCC ACCATGACAT CCGCGCACAT GCGGAAGATC
ATGGAAGACG GGGAAGGACA TTTCGAATCG CGCCATCGGT GCAAGGACGG AAGCGTTATC
GACGTGGAAG TCAACGTCAG GTACTGCCCC GAAGATGGCG GCCAGTTTAT CGCCTTCCTG
CGCAACATTA CGGAGCGCAA ACGTGCTGAA GCGCTCCATA GGATGGGGCA GGATATCCTG
CTCGCGCTCA ACGAGAATGA AGATATGAAG GAGGCGATCC AGCGGGTTCT GGGTCTGCTG
AGATCGGCTA CCGGGGTCGA TGCCGTTGGG ATTCGCCTGC AGGACGGAGA CGATTTCCCC
TATTTCTACC AGGAAGGGTT CCCTCCGGAC TTTCTGCAGA AAGAGGACTC CATCGTAGCA
AGGAATAAGG ATGGCGGGGT GTGCCGGGAC TCATGCGGCA ACGTCTGCCT GGAATGCACC
TGCGGCATGG TGATTTCCGG CAGTACGGAC CACGCCAACC CGCTCTTCAC CCCGGGGGGG
AGCGCCTGGA CCAATGATTC GTTTCCCTAC CTCGAAGTGC CGGCCGACCA GGACCCCAGG
ACAACTCCGC GCGACGAATG CATCCACCAG GGGTACGCAT CGATCGCCCT GATCCCCATC
CGGGCAAAGG GACGAATCGT CGGGCTGCTG CAGCTGAACG ACCGCCGCAA GGGGCGCTTT
ACTCGCGAGG GGATCGAAGC CCTCGAAGAC ATCGCCAAGA ACATCGGCGA GGCAATGCTG
CGCAAGCAGG CCGAGGAGAA ACTTGTTGCC AGCGAGCGCT TCCTCCGCAT GCTCACCAAT
CAGCTGCCGG GGATGGTCGG CTACTGGGAC CGGGATCTCC GCTGTCGCTT CGCGAACGAT
GCCTACAAGG AATGGTTCGG CAGGTCGCCG GAGCAGATCA TCGGCCTCAC CGTCCAGGAA
TTGATGGGGG AAGAGCTGTT CCGCCTGAGC GAGCCGTACA TCCGGGGCGC CTTGCAGGGG
GAACCCCAGA ATTTCGAGCG GGAGCTGGTG AAGCCGAACG GCGAAACGGG CTACACCTGG
GCCCAGTACA TTCCCGACAT GGTGAACGAC AAAGCGATCG GCTTCTTCGT GCTTATCTCG
GATGTGACCG AACTCAAACG AGCCGAGCAG GAAAAGGCTG CTCTCGAAGT CCAGCTCATG
CAGGCCCAGA AAATGGAATC CGTGGGACGT CTGGCCGGGG GTGTGGCACA CGACTTCAAC
AACATGCTGA GCGTCATCCT GGGGCATACT GAAATGGCAC TCCTGCGCAT GGACCCCAAC
CAACCGACCT ATGCCTCGCT CCGCGAAATC GACAAAGCAG CCCAACGTTC TGCCGACCTT
ACCCGGCAAC TCCTGGGCTT TGCGCGGCAG CAAACCGTTT CCCCCAAAGT GCTGAACCTG
AATGAAAGCA TTTCGGGGCT GCTCACTATG CTGCACAGGC TGATCGGCGA GAATATCCGC
CTTCAGTGGC AGCCGGCGGC GAACCTGTGG CAGGTTCGCA TGGATTCGTC GCAGATCGAC
CAGATCATGG CAAACCTGTG CGTTAACGCC CATGATGCCA TTGCCGACGT GGGCGAGATC
ATCATTGAAA CAGGGAACTG CACCTTCGAC GAGAGGGACG GTACGATTCA TCCCCATGGG
GCAGTCGGTG ATTACGTGAG GATCCGCGTG AGCGACAATG GTTCGGGGAT GGACAAGGAG
ACGCGTGCCC ACATCTTCGA ACCGTTCTTC ACCACCAAGG AAGTGGGAAA GGGAACCGGC
CTGGGATTGG CGACCATCTA CGGCATCGTC AAGCAAAACA ACGGACTGAT CGATGTTGCC
AGCGAGCCGG ACAAGGGAAC GACCTTCTCC ATCTATCTCC CCCGATTCAA AGGAAGCGAG
GAGCCGGCGA CGCCGAGAAC GGCAACGTCC TCCTTCAACC GGGGCAATGA AACGGTCCTG
CTGGTGGAGG ATGAACCGGC TATCCTGGCG ATGACAACGG AGATTCTGAC CCAACTGGGC
TACGCGGTCC TGCAGGCGCC CACCCCCACC GAGGCGTTGC GCATCGGCTG CGAGCACAAG
GGGGAGATCC ATTTATTGAT GACGGACGTC ATCATGCCCG AAATGAACGG ACGGGAGCTG
GCCAAGAGCC TCTCACAGTT CTATCCCCGG ATCAGGTGCC TGTACATGTC GGGGTACACC
TCGGACATCA TTTCCCCCCA CGGCGTGCTC AACGACAGCG TCCACTTCCT CCAGAAACCG
TTCGACTTTA CTACCCTTTC GCTAAAGCTG CGCGAGGTGC TCGACTGA
 
Protein sequence
MGSAPYVRGG TVDYTLKDLL DIPGLQALLD SLRALHQLAS SVIDPEGTIL AASGWQRLCT 
EYHRAHPPSR QKCIENELRV GSQAGESLMP VIHRCPMGLE YAVTPIVIEG EHLANIFVGQ
IFTSSPDESY FVRQARQFGF DEDAYLAEMR NVPIFDEETL HSYLALFRSI ANMLAEQGLH
VLRQRVANEE LRKSEKRHQT ILQAALGGLC TIDMQGRLLE VNEAYCQKIG YSRQELLTMS
IFDLEAAETP TMTSAHMRKI MEDGEGHFES RHRCKDGSVI DVEVNVRYCP EDGGQFIAFL
RNITERKRAE ALHRMGQDIL LALNENEDMK EAIQRVLGLL RSATGVDAVG IRLQDGDDFP
YFYQEGFPPD FLQKEDSIVA RNKDGGVCRD SCGNVCLECT CGMVISGSTD HANPLFTPGG
SAWTNDSFPY LEVPADQDPR TTPRDECIHQ GYASIALIPI RAKGRIVGLL QLNDRRKGRF
TREGIEALED IAKNIGEAML RKQAEEKLVA SERFLRMLTN QLPGMVGYWD RDLRCRFAND
AYKEWFGRSP EQIIGLTVQE LMGEELFRLS EPYIRGALQG EPQNFERELV KPNGETGYTW
AQYIPDMVND KAIGFFVLIS DVTELKRAEQ EKAALEVQLM QAQKMESVGR LAGGVAHDFN
NMLSVILGHT EMALLRMDPN QPTYASLREI DKAAQRSADL TRQLLGFARQ QTVSPKVLNL
NESISGLLTM LHRLIGENIR LQWQPAANLW QVRMDSSQID QIMANLCVNA HDAIADVGEI
IIETGNCTFD ERDGTIHPHG AVGDYVRIRV SDNGSGMDKE TRAHIFEPFF TTKEVGKGTG
LGLATIYGIV KQNNGLIDVA SEPDKGTTFS IYLPRFKGSE EPATPRTATS SFNRGNETVL
LVEDEPAILA MTTEILTQLG YAVLQAPTPT EALRIGCEHK GEIHLLMTDV IMPEMNGREL
AKSLSQFYPR IRCLYMSGYT SDIISPHGVL NDSVHFLQKP FDFTTLSLKL REVLD