Gene GSU3363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3363 
Symbol 
ID2688233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3694820 
End bp3697111 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content59% 
IMG OID637128057 
Productsigma-54 dependent trancsriptional regulator 
Protein accessionNP_954403 
Protein GI39998452 
COG category[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCCGA TAATTACCGA CAAGGAAAAG TGCCGCAAGT GCTACTGCTG CGTCAGGAGC 
TGCCCGGTCA AGGCAATCAA GGTTGAAAAA CGCTACACCG AAATTATCTT TGACCGCTGC
ATCGGCTGCG GCAACTGTCT GAGCAATTGC CCCCAACGGG CAAAGATGGT GGCCGACAAA
GTGGGTGTTA CCGAGAGTCT TCTCACCTCG GAAGACAAGG TAATCGCCGT GCTCGGATCA
TCATTTCCAG CCTTTTTTCA CAATGTTGCA CCGGGCCAGC TTGTGGCCGG CCTCAAGCGG
ATCGGCTTCC GTGAAGTCCA CGAAGGGGCC TACGGTGCTG AACTGATCGC TGACGATTAC
GCCCGTATTA CCGGCGATAA TGACCGGACC TACATTTCAT CCCACTGCCC GGCCATCGTT
GACCTGATCG AGCGACACTA CCCGAAACTG CTCAGAAACC TGGTGCCCGT CGTTTCTCCC
ATGATCGCCA TGGGCCGCTA TCTCAAGGAA GTTCTGGGCC CCAAGACCAA GGTGGTCTAC
ATCAGCTCCT GCATTGCCGC AAAGTTCGAA ACCCAGATGA CCGAAACGCG GGGAGCCATC
GACATTGTTC TCACCTATCG GGAACTGGAG GGCATCTTCC GCAGCCGGCA GATTTCCCTC
CCGGCCCTCG ATGCGCTTCC CTTCGACGGG CTTGCGCCCA CGGTGGGACG ACTTTTCCCC
ATCAACGAGG GAACCTTCCG GTCGTTCTCC ATGTCGGCGG ACCCCCTTGA CACTGAAATA
GTCTCCGCCT GCGGCGAAGT AAACGTCATG GGCATCATCA GAGACTTGGC TGCAGGCAGG
ATCGCCCCCC GCTTTGCCGA TCTGCGCTTC TGCTATGACG GGTGCATCGG CGGTCCCGGC
CGCAATAGCG AACTGACCGA ATTCTACCGC CGCAACCTGA TCATTAACCA TTACAAGCAG
GACATCCCCT ACGAGACGGC TCCGCACTAT CTCTCCTCCC GTGAGCAGAC CGGCCTGGAC
CGCTCTTTTG CCAGCAAGCA CGCGCGCCTC GAGTCGCCAA AAACAAATGA TATCAAGAAA
ATACTGCAAG CCACCAGCAA GTACTCGGTC AAGGACGAAC TCAATTGCCG CGCCTGCGGC
TATCGCACCT GCCGCGAGTA TGCCGTGGCG GTCTACCAGG GCTTGGCCGA GATCGAGATG
TGCCTTCCCT ACAACCTCCA GCAACTGGAA GAGGATCGTG GTCGCCTGAT CCAGAAGTAC
GAGCTCGCCC GTCGAGAACT CGACCGCGAG TATACTGACG AGTTCATCGT CGGCAACGAC
CGCAAGACCT TGGAAGTACT TGATCTCATC AAGCAAGTGG GGCCGACGCC GACGACGGTT
CTGATCCGGG GCGAGTCGGG AACCGGCAAG GAACTCACCG CCCGAGCCAT TCATCGCTTC
AGCAAGCGCA ACGATAAGCC CCTGGTGACC GTGAACTGCA CCACTATCAC CGACTCCCTT
CTGGAGAGCG AACTCTTCGG ACACAAGCGG GGGGCCTTCA CCGGTGCCAT CGCCGAGAAA
AAGGGGCTGT TCGAGGCCGC CGACGGCGGC ACCATTTTCC TGGATGAAAT CGGCGATATC
ACGCCGAAAT TGCAGGCAGA GCTCCTGCGT GTCCTCGACA TGGGCGAGGT GCGTCCCGTT
GGGGGTACGA CCGCCCGCAA GGTCGACGTA CGCCTCATTG CCGCAACCAA CCGGAACCTC
GAAGAAGGAG TACGGGAGGG CTGGTTCCGC GAGGATCTGT ATTATCGCCT CAACGTTTTC
ACCATTACCA TGCCGCCTCT GCGCAACCGG GTGGAGTCGA TTCCGATCCT GGCGCACCAC
TTCATGGAAA AAGCCAGCAC CAAACTGAAC AAACGGCTCT CAGCCATCGA AGAGCGGGCC
GTCATCGCCC TCACCAAGTA CCCTTGGCCC GGCAATATCC GCGAGATGCA GAACGTCATC
GAACGGGCTG CGGTTCTTGC CCACGACGAT GTTATCCACC TGGAGAACCT CCCCCTGGCC
CTTTCGGAAA ATCTGGCAGG AAGCCCGACC GCCGACCTGG ATATCCGGGC TTCCTTCCGC
GCGGAGCGCG AAAGGCATGT GGTCAAGCTC GAGAAGAAGC TGATCCAGCG CTATCTGGCC
GAGGCAAACG GCAATGTGAG CCGTGCCGCG CGGCTCGCCA ACATTCCCCG CCGAACCTTT
TACCGCCTGC TCGACAAGTA TCGCCTCAAA GACAAAGAGG TGCGGGATAT GCCGCAGGGG
GACAGACAAT AG
 
Protein sequence
MEPIITDKEK CRKCYCCVRS CPVKAIKVEK RYTEIIFDRC IGCGNCLSNC PQRAKMVADK 
VGVTESLLTS EDKVIAVLGS SFPAFFHNVA PGQLVAGLKR IGFREVHEGA YGAELIADDY
ARITGDNDRT YISSHCPAIV DLIERHYPKL LRNLVPVVSP MIAMGRYLKE VLGPKTKVVY
ISSCIAAKFE TQMTETRGAI DIVLTYRELE GIFRSRQISL PALDALPFDG LAPTVGRLFP
INEGTFRSFS MSADPLDTEI VSACGEVNVM GIIRDLAAGR IAPRFADLRF CYDGCIGGPG
RNSELTEFYR RNLIINHYKQ DIPYETAPHY LSSREQTGLD RSFASKHARL ESPKTNDIKK
ILQATSKYSV KDELNCRACG YRTCREYAVA VYQGLAEIEM CLPYNLQQLE EDRGRLIQKY
ELARRELDRE YTDEFIVGND RKTLEVLDLI KQVGPTPTTV LIRGESGTGK ELTARAIHRF
SKRNDKPLVT VNCTTITDSL LESELFGHKR GAFTGAIAEK KGLFEAADGG TIFLDEIGDI
TPKLQAELLR VLDMGEVRPV GGTTARKVDV RLIAATNRNL EEGVREGWFR EDLYYRLNVF
TITMPPLRNR VESIPILAHH FMEKASTKLN KRLSAIEERA VIALTKYPWP GNIREMQNVI
ERAAVLAHDD VIHLENLPLA LSENLAGSPT ADLDIRASFR AERERHVVKL EKKLIQRYLA
EANGNVSRAA RLANIPRRTF YRLLDKYRLK DKEVRDMPQG DRQ