Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3662 |
Symbol | |
ID | 5166604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 4289855 |
End bp | 4291519 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640551146 |
Product | putative transcriptional regulator |
Protein accession | YP_001232388 |
Protein GI | 148265682 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACAAC AAGCCTTACA TATACTTTTG TCTGAGTTAA TCTCCCGTTG GGAAAATGAA GTAATCGAGT TCAAAAATGT CGGTGACTCA TATTCGACAT CTGACATCGG CAAATATTTT TCGGCTCTTG CCAATGAAGC CAACCTGCGG GATGTTGAAA AGGCTTGGCT GGTTTTTGGT GTGAATAATA AAAGCAGATC CATCGTCGGC AGCGATTACC GGCAGGACAA TGAGCGCCTA CAAAGCCTGA AAATGCAGAT TTCAGCGGAG ACTGAGCCCA GCATTACGTT CCGTGAAATT CATGAGTTGC AAAGTGATAA GGGACGCGTA ATCCTGTTTG AAATACCTGC TGCCCCTTTG GGGATGCCGA TTGCCTGGAA AGGGCATTAT TACGCCCGCG CAGGCGAAAG TCTGACTCAT CTTGGCCTGG ATAAGCTGGA TGAAATTCGT AAACAGGTAG GGGCAACTGA CTGGTCTGGT CAGGTGGCTA ACGCAGCTAC ACTGGATCAT CTGGACACAC ACGCACTGAC TAAGGCTCGT GAATCTTTCG CCAAAAAGTA CGCTAATCGT TTTGCCCTGG AAGAGGTTAT GACATGGCCT GAAAGTACAT TTTTGGACCG TGCAAAACTA ACAAAAGATG GCCAAATTAC CCGCGCTACC ATTTTATTAC TGGGAAAAGC TGAAGCAGCC CATTTGCTCT CACCGCAACC AGCACAAATG ACCTGGAAAC TGGAGGGGGT TGAGCGTGCT TACCAACACT TTGGCCCCCC ATTTCTGCTG AATACCAGCA TTCTGTATCA GAAAATCCGC AATATTCAGT TGCGTATCCT GCCGGAAGAT CAATTGCTCC CCATTGAAGT GGCCAAATAT GACCAGAAGA TTGTGCTTGA GGCATTGCAC AACTGCATTG CTCATCAGGA CTACGGCCGC AATGGGCGCA TCATTGTCAC AGAACTACCT GACAGGCTGA TTTTTGAAAA CGAGGGCTCG TTTTATGAAG GGCAACCAGT TGATTATATT TCTGGGCACA AAACCCCCAG ACGCTACCGT AACCCTTTTC TGGCCCAGGC GATGGCTGAG CTTAATATGA TCGACACCAT GGGGTATGGC ATTTATGAAA TGCACATTGG ACAGGCACGG CGCTATTTCC CCCTGCCTGA TTATGATTTG AGCGAGCCCT ACGCGGTCAA GATGACAATC CATGGCAAGA TTGTTGATCC TGCCTACAGT CGTATGCTTA TCCAGAAAAC TGATTTGTCA TTACAAGACA TATTTGCCCT TGATCGTGTC CAGAAAAAAC TCCCACTTGA TGACGTAATG GTGAAACATT TGCGCCGAGC AAATCTTATT GAAGGCAGAA AACCCAACCT CCACGTCTCC GCTACCGTTG CCGCTGCGAC AGCGAGCAAG GCGGACTATA TTCGTACCCG TGCCCAGGAT GACGATTATT ACACCAAGCT GATACGTGAT TATCTGGTTA AGTTTGGTTC TGCAACCCGC AAGGAGATTG ATACTCTGTT GTGGGGCAAA TTGAGTGATG CTTTGGATGT CGAACAAAAA CAGAACAAAA TCGGCAACTT GATTACCAGT ATGCGAACTA CAGAAACAAT AGTGAATACG GGATCTCGTA AATTACCCAA ATGGACTCTC AAAGAAAAAA AATAA
|
Protein sequence | MAQQALHILL SELISRWENE VIEFKNVGDS YSTSDIGKYF SALANEANLR DVEKAWLVFG VNNKSRSIVG SDYRQDNERL QSLKMQISAE TEPSITFREI HELQSDKGRV ILFEIPAAPL GMPIAWKGHY YARAGESLTH LGLDKLDEIR KQVGATDWSG QVANAATLDH LDTHALTKAR ESFAKKYANR FALEEVMTWP ESTFLDRAKL TKDGQITRAT ILLLGKAEAA HLLSPQPAQM TWKLEGVERA YQHFGPPFLL NTSILYQKIR NIQLRILPED QLLPIEVAKY DQKIVLEALH NCIAHQDYGR NGRIIVTELP DRLIFENEGS FYEGQPVDYI SGHKTPRRYR NPFLAQAMAE LNMIDTMGYG IYEMHIGQAR RYFPLPDYDL SEPYAVKMTI HGKIVDPAYS RMLIQKTDLS LQDIFALDRV QKKLPLDDVM VKHLRRANLI EGRKPNLHVS ATVAAATASK ADYIRTRAQD DDYYTKLIRD YLVKFGSATR KEIDTLLWGK LSDALDVEQK QNKIGNLITS MRTTETIVNT GSRKLPKWTL KEKK
|
| |