Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2475 |
Symbol | |
ID | 2687863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 2712307 |
End bp | 2715018 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637127165 |
Product | sigma-54 dependent transcriptional regulator |
Protein accession | NP_953521 |
Protein GI | 39997570 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.198348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGTCTCG TCTCCGGCCA GGACCGGATC CGCATCTACC TGGAAGACCT GACCCGCGGC GTCCTTACCT GTGTCCACGC CTCGGGCCCC CTGGCCGATG AGATCCGCGA GGTGAGCTTC CCCATCATCT CCCGTGAGGC GTCGGTTTCC AGCGTCTTTG TCTCCCAGTA CGCCACCGAA TTCCACTACG AGCCGGAGGG AAAACACACC TTCGACCGGG GCTTTGCCGA ACGCTTCGCC ATCGGCCACA GCTACATCCT GCCGGTGGTG AGCCAGGGGA AATCCATCGG CGTGGTCTGC GTCGACCGCT TCCGGCCGGG GGAGATCCTG CGCGGCAAGG GGAAGGCGCT GCTGGGCGAG TTCGTCACCT CCGTGGCCGA CCGGCTCGAC GTCGCCCGCA TCTACCACCA GCAGCTCCTG CTCGCCCGCC GGGTCGAGGA GTACAAGAAA CGGGAAGCTG CCTCGTTCAT GGTGCAGTCG GCCGTGCGCC TCATCGACCG GCTGGTCCTC GCCTCGGTGC TGGTGCCCGT GCCCGGTCCG GAAGGCTCGT CGCGCCTCGC CATCCTGGCC AGCCACTCGG AAGACCCCAG CCTGAAAAAG CAGTATGACG AGCAGGGAGA GATCGCTCTC CAGAGGGGTA CCTCCCTCAT CTCCCGGTTC CTGGACGACA ACGCGGTCAT CGCCGACGAA CGGCTTCTGC GCCCGCTGTT CATTCCGGAC CTGACCCAGC AGGAGCTCCA GAAAAAGGCC CTCACCGAGA AGATGGCGCT GCGCTCCCTC TACGTGGTGC CCCGCTACGA GCCGTCCAGC CGCAAGGTCA TCTGCCTGGT CAACTATTTC ACCAAAGACC TGTACCGCTT CTCCGACTTC GAGATGGGCC TGCTCCAGAC CCATGCGGAG ATGGCGGAGC GGATGGTGAA CGAGATCGGC GGCGAACACT TGGAAATCCG CGTCCTGGCC GAGATCACCG AACTCCTCCA GGAGCGCAAC GAAGAGCTTT CCCCGTTCCT CACCCGGGTC CTGTCAATGG CCACGGAGCT GATCGGCGCC GATACCGGCA GCATCGCCAT CGTCCAGGAG CGTGACGGCG AAAAATGGCT TGTGGTGGAG GACGAAGAAG GGACCATCGT CGGGGCCAAG AACAAGTCGT GGCTCAAGAA GTACATCCCC CCCTTCAGAA TCGGCGGCCA CGAGCTCCCC GCCGAGGAGC GGAGCCTCAC CGGCTACGTG GCCTGGTCCA AGCAGCCGAA GATCATCGCC CACGTGGCGG ACGAGCAGGG GGGCGAGGGG TTCCACCGCT CCATGCACGA GCTGATCAAG AGCGAGATCG CGGTCCCCAT CGTCTGCGAC GACGAGGTGA TCGCCGTGGT CTGCCTCAAC TCGCTCAAGC CCGCCTGGTT CACGGAAGAG CACAAGCGGA TCCTGCAGAT CATCGACCGG CTCACCTCCC GGCACATCTC CGACGTCCAG CGGATCGAGC GGCTCGAGGG GGAGGTGACC CGGCTCAAGA CTGACGTGGC CTACAAAGAC CCCCAGATCT CCTCCTACCG GCTCGGCAAC ATCATCGGCA ACAGCCGCAA AGCCCAGGAG ATCGTTGATT TCATCAACAC CGTGTCGGTG CCCCTCTTCA ACCGGATCAC CCTCTGGAGC AAGAACGTCC TCCAGGAGGC GACCATCGGC CTCCCCTCCA TCCTCGTCCA GGGGCAGACC GGCGCCGGCA AGGAGTTCTT CTTCAACAAC CTCTACAACA AGCTGAACGA GATGTACCGG GAGAAGCTCA ACCCCGCTGG CCAGCTCCCC GTGAAAAAGA CCAACATCGC GGCCTACAGC GGTGACCTGA CCTACTCGGA ACTCTTCGGC CACAAGAAGG GGGCCTTCAC CGGCGCCTAC AGCGACCGCA AGGGCATCCT GGAGGATGCC GCCGGCGGGA TCGTCTTCCT GGACGAGATC GGCGACGCCG ACCCCAAGAC CCAGGTGCAG CTGCTCCGGT TCCTGGACAA CGGCGGGTTT GTGCGGCTGG GGGAAAACCA GGACCGTTTC AGCCGGGTGC TCCTGGTGGC CGCCACCAAC AAGGATCTGG CGGAAGAGAT CCGCAAGGGG AACTTCCGGG AAGACCTCTA CCACCGGCTG TCGGAGCTGG CGGTGCAGGT GCCGTCCCTG AACGAGCGGC GCGAGGACAT CCCCGACCTG GCCACCCACT TCCTGGGCAA GCTCTACCGC ACCTACCGGG GGGACGAGTC CAAGGACGCC GCCCCCACCC TGGCGGAGGA GGCCAAGCGG CTCCTGATGA ACCACCACTA CCACGGCAAC ATCAGGGAAC TGCGGAGCAT CCTGCTGCGG GCGCTCTTCT TCCGCAAGGG CACGGTGCTC ACCGCCGACG ACGTCCGGCG CGCCCTGGCC GCGGGCATGC GCGAGTTCGC CCCTGCCACC GCCACCCAGG AGCTGAACGA CCGGATGGTG ACGGAAATCC TGGACAAGAT CGCCAATGGC GAAACCTTCT GGGAAGCGGT CTACGAGCCC TACTCCCGCA ACGCCATCCC CCGCGACGCC GTCCGGCTCG TCATCGAGCG GAGCCGCGAC GCGGCGGGCC GGAGCATGCC CCAGGTGGCC CGCTACCTCA AGGCCGTGGG CGACGATGTG GAAGAAAACG ACGAGGAGCG GAAGCGGTTC TTCAAGTTCA AGAATTTCCT CTATAAGACC GTGAAGATCT GA
|
Protein sequence | MRLVSGQDRI RIYLEDLTRG VLTCVHASGP LADEIREVSF PIISREASVS SVFVSQYATE FHYEPEGKHT FDRGFAERFA IGHSYILPVV SQGKSIGVVC VDRFRPGEIL RGKGKALLGE FVTSVADRLD VARIYHQQLL LARRVEEYKK REAASFMVQS AVRLIDRLVL ASVLVPVPGP EGSSRLAILA SHSEDPSLKK QYDEQGEIAL QRGTSLISRF LDDNAVIADE RLLRPLFIPD LTQQELQKKA LTEKMALRSL YVVPRYEPSS RKVICLVNYF TKDLYRFSDF EMGLLQTHAE MAERMVNEIG GEHLEIRVLA EITELLQERN EELSPFLTRV LSMATELIGA DTGSIAIVQE RDGEKWLVVE DEEGTIVGAK NKSWLKKYIP PFRIGGHELP AEERSLTGYV AWSKQPKIIA HVADEQGGEG FHRSMHELIK SEIAVPIVCD DEVIAVVCLN SLKPAWFTEE HKRILQIIDR LTSRHISDVQ RIERLEGEVT RLKTDVAYKD PQISSYRLGN IIGNSRKAQE IVDFINTVSV PLFNRITLWS KNVLQEATIG LPSILVQGQT GAGKEFFFNN LYNKLNEMYR EKLNPAGQLP VKKTNIAAYS GDLTYSELFG HKKGAFTGAY SDRKGILEDA AGGIVFLDEI GDADPKTQVQ LLRFLDNGGF VRLGENQDRF SRVLLVAATN KDLAEEIRKG NFREDLYHRL SELAVQVPSL NERREDIPDL ATHFLGKLYR TYRGDESKDA APTLAEEAKR LLMNHHYHGN IRELRSILLR ALFFRKGTVL TADDVRRALA AGMREFAPAT ATQELNDRMV TEILDKIANG ETFWEAVYEP YSRNAIPRDA VRLVIERSRD AAGRSMPQVA RYLKAVGDDV EENDEERKRF FKFKNFLYKT VKI
|
| |