Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1666 |
Symbol | |
ID | 4069814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2015940 |
End bp | 2017232 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637983674 |
Product | putative transcriptional regulator |
Protein accession | YP_590741 |
Protein GI | 94968693 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCGA CATTGGCGCA ATCACTCCAG AGTGTTCTAG ATGAATTGTC AGTAGGAATC ACTGCCAATG GTTATCGCTC TCTAAACCTA CGTGTTCTCC TAGAGAATCC GAACAAGTTC GTAACGGGAT CGCTAACCTT CAGCGAAAAC AGCTTTCGCG TATCCGATCC GGAGATACAT AAGTACAACG ATGCGACGCT CATCTCAGTG GTTTCTGACG TCGAGATGTG GCGCGATTTC TTGGTGCGGA TGCTCACGGA TAACTTCTCG TTATTTGGTA CAGAGATCCC ACTATCGTTG GCGTTCAGCC AGTCACAACC GGATACCGAT CTCAATTCGG ACATTTCCTC TCCGCGCTTC GCGTATCACT TCGATCTTAA TAATCGCTAC TACCATCAGA CCCGAGGAAC ACTCATGGCG ATAGGGCGAC CACCGTATGC CAGTATCGAG GATGCGTCAG CGCGATTCGT GCATCAATTG AAATCACGAA CAGGTGGGAT TCTCCACGAA GGCAGGCTGG TGGTGAGCCT TCCGACGAAA CCGCTCATCG CTTCCGCCGA ATGGATTCCT GGCGAGCTCC GAATAAGATT TCATGAAGGG CTGCAGCCGC GTTATCAATT GGATATTTTG TTTTGGACGG GCGCCTCCGT CATCGAATCT CAATCACATT CCTCGCTCGG GCGAGACTTT ACGTCCGCCG TTCCGAAACG AACCACGAGA ATTACGGCAT ACCTAATCCG CGATGATGGG AATGTCGCGC ATAGCTTCGA ATTGCGGTCT CCGTATACTT TCGTCGGGGA AAAAACAGCA ACACTCAGCT TGGAACAGCT GGTAAGAGCC GACATCGCCG CCGGCGAAAG CGAGATACGG GAGATGAAGG CGTTTTTTAA CCCTGACGGA AACACCGAAA TGAAGACGAG AGTGCTTCAT ACGACCATTG CGTTCGCTAA TACGCAGGGA GGAACGATCT ACATCGGCGT TGAAGATAAC GGCGAATTTT CAGGAACTCC GAAACTGCGT GGCGTGAGTT CCAAACGCGT TCCCTCAGAG TGTGCGAAGG ACATCTCCAA GAAACTGAGA AAACTCATCA TCGAGAACAC ACGACCGGTT GTGGAAGTGC AAGCCAACGA AATCTGCCTT TCAGGGGATT GGGTGATTGC GCTTTCGGTC GGGAAGTCCA CCGAGGTGGT GAGCACTCAT CGAAATGTAG TGGTGATTCG AGCGGGTGCC AGTAACCGAC TCCCTGACCC AAAATGGTTC GAACAGCGCA AAGCCGACAA CTCTTGGTTC TAA
|
Protein sequence | MASTLAQSLQ SVLDELSVGI TANGYRSLNL RVLLENPNKF VTGSLTFSEN SFRVSDPEIH KYNDATLISV VSDVEMWRDF LVRMLTDNFS LFGTEIPLSL AFSQSQPDTD LNSDISSPRF AYHFDLNNRY YHQTRGTLMA IGRPPYASIE DASARFVHQL KSRTGGILHE GRLVVSLPTK PLIASAEWIP GELRIRFHEG LQPRYQLDIL FWTGASVIES QSHSSLGRDF TSAVPKRTTR ITAYLIRDDG NVAHSFELRS PYTFVGEKTA TLSLEQLVRA DIAAGESEIR EMKAFFNPDG NTEMKTRVLH TTIAFANTQG GTIYIGVEDN GEFSGTPKLR GVSSKRVPSE CAKDISKKLR KLIIENTRPV VEVQANEICL SGDWVIALSV GKSTEVVSTH RNVVVIRAGA SNRLPDPKWF EQRKADNSWF
|
| |