Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3892 |
Symbol | |
ID | 4072227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4604301 |
End bp | 4605779 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637985916 |
Product | GntR family transcriptional regulator |
Protein accession | YP_592966 |
Protein GI | 94970918 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.179019 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.485504 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAGTC GTCCTTCATC GGCAGTGCCA ATGATCGGAG TCGATCGTCA TGCCGCACTG CCGCTGCATC GGCAGTTGTA TGAGAGCTAT CGCTCGGCCG TCCTCGGCGG CCGATTGCGG CCGGGGCAGA TGGTGCCCTC CACCCGCGCG TTGGCAATGG AGTTGGAAAT CTCCCGAATG CCGGTGTTGA CGGCATACGC ACAACTGCTC GCGGAGGGAT ATTTCGAAAC GCGCTCCGGG ATCGGAACCG TCATCAGCGA GGCACTGCCC CAGCAACGTT CGGTGGCGGG CAAAGCGCAG ATTCCGCCTC CAATCGAGCC CGGTCCGCGG AATGTCTCGT CGCGATGTTT GACGCCCTAC CGGCCCTTCC TGCCTCCCTG GATCGCGGGG AGAGGCGCGT TCAACGTGGG AGGCGTGGCG CTGGAACACT TCCCGCATCG TACCTGGACG CGACTGGTCG CGCGCAGCGC GCGCAACGGC GCGGGCATTG GTCCGGCGCG CGACGTGATG GGGTCGATCG AGTTACGCGA AGCGGTTGCC GATTATCTGC GGACGTCTCG GGCTGTGAAT TGCACGGCCG ATCAGATCAT GATCACGAAC GGCTCACAGA ACGGCGTGGA ACTCACAGTG CGCGCGCTGC TCGATCCCGG TAGCGCGGTC TGGATGGAAG AGCCTGGTTA CCAACTCGCA CGCGACGTGC TGAGCATGGC GGGATGCCGC ATCGTGCCGG TGCCAGTGGA CCAGCAAGGC ATCGATGTGG TGCGCGGAAT CAAGATGTGT CGCAATGCTC GCGCTGTGAT CGTTACCCCG TCACACCAAT TCCCTCTTGG GATGACGATG AGCGTTTCGC GCCGGTTGGA GTTGCTGGCT TGGGCGTCGA AGCAAGGTTC ATGGATCATC GAGGACGATT ACGACGGCGA ATTTCGGTAT GAGAGCAAGC AGATCGCGTC GTTGCAAGGC CTCGATCGCG ACGGCCGCGT GATTTATATC GGTACGTTCA GCAAAGTGTT GTCATCTGCA CTGCGCATTG GCTATGTGGT GATTCCGCGC GACCTGGTGC CGCACTTCAT GCGAGTTCGG TGGTCTACCG ATCTTGGCTC AGAAGAGCTG ATCCAGACGG TCGTCAACGA CTTCCTGCGT GAAGGCCATT TTGCGCGACA CCTGCGGAAC ATGCGCCTAA CCTATCGCGA ACGGCGGAGC GTGTTGGTGG AGTCTCTCGA GAAGACGCTC GGAGGCGAGG CTGAAATCGT GGGAGCAGAG GCGGGGTTAC ACCTGGTCGT GCTTCCAAAA GGCCTGAAGG ACGACGTGGA GATTTGCAAG GCAGCGGCAC GGGAGCCACT GTGGCTGTGG CCGCTTTCAC ACTGCTATCA CGGTCCAGCC GCAAAGCACG GGTTTGTCCT CGGCTTCGGC GGGACACCGC CAGAGAGAAT TCCGAGGGCG GTGAAGCAGA TCCGCGAGGT GTTGCGCGGA CGGAGATAA
|
Protein sequence | MSSRPSSAVP MIGVDRHAAL PLHRQLYESY RSAVLGGRLR PGQMVPSTRA LAMELEISRM PVLTAYAQLL AEGYFETRSG IGTVISEALP QQRSVAGKAQ IPPPIEPGPR NVSSRCLTPY RPFLPPWIAG RGAFNVGGVA LEHFPHRTWT RLVARSARNG AGIGPARDVM GSIELREAVA DYLRTSRAVN CTADQIMITN GSQNGVELTV RALLDPGSAV WMEEPGYQLA RDVLSMAGCR IVPVPVDQQG IDVVRGIKMC RNARAVIVTP SHQFPLGMTM SVSRRLELLA WASKQGSWII EDDYDGEFRY ESKQIASLQG LDRDGRVIYI GTFSKVLSSA LRIGYVVIPR DLVPHFMRVR WSTDLGSEEL IQTVVNDFLR EGHFARHLRN MRLTYRERRS VLVESLEKTL GGEAEIVGAE AGLHLVVLPK GLKDDVEICK AAAREPLWLW PLSHCYHGPA AKHGFVLGFG GTPPERIPRA VKQIREVLRG RR
|
| |