Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_0479 |
Symbol | |
ID | 5878085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 491827 |
End bp | 493965 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641540816 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001662125 |
Protein GI | 167039140 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00985663 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGGA TAAAAGATAT ATTAAAGAAA GAATTTGGAT TAAAAGACTT TCAAGTGGAA AATACAATAA AGCTTATTGA TGAAGGGAAT ACAATTCCTT TTATTGCAAG GTACAGGAAA GAAGCAACAG GAAGCTTGTC AGATGAGGTT TTAAGAAATT TTTATGACCG ACTTACATAT TTGAGAAACC TTGAAGAGAA GAAACAGGAT ACTATAAGGT TAATTGATGA ACAGGGAAAA CTGACAGAGG AGATAAAAGC AAAAATAGAA AATGCTACAA CTTTACAAGA AGTAGAGGAC ATATACAGGC CTTTTAGGCC AAAAAGAAGG ACAAGGGCGA CCATTGCGAA AGAAAAAGGC TTGGAACCTC TTGCAAAGGT TATTTCTTCA AATGATGTAA CAGATGGGGA TGTAGAAGAA TACGCAAAGC CTTATCTTAA TGAAAATGTT CCAACAGTTG AAGAAGCATA CCAAGGAGCA ATGGATATAA TTGCAGAGGA TATATCTGAT GACGCTGATA TAAGGAAATA CATAAGGAGT TTTACATGGA ATAACGGAAT TATAGTGACA CAGGCATTAA AAGAGGATAG GTCTCCTTAT GAGATGTATT ATGACTATAA AGAAGCTGTA AAGACAATAC CGCCTCATAG AATACTGGCG ATAAACAGGG CAGAAAGGGA AAAATACATA TCTGTAAAAA TTGAGATAGA TAGTGAAAGG ATAATAAATA GGCTTATAGA ATCAAAAGTA AATAAAGCTT CAATATTTGC AGAGTATTAC AAGAAAGCAA TTGAAGATTC CTACAAAAGG CTTATTGCGC CTTCAATAGA GAGAGAGATA AGGAATGCTC TTACGGAAAA AGCAGAAGAA AAGGCAATAA TAGTTTTTAA AGAAAATCTA AAAAGCCTTT TACTTCAACC CCCAATAAAA GGACATGTTG TCATGGGATT TGACCCAGCC TATAGGACAG GATGTAAAAT TGCTGTTGTA GATGAAACGG GAAAACTTTT GGACACTGCT ACAGTATATC CTACTCCTCC TCAAAATGAT TTTGAAAATT CTAAAAAGGT TTTAAAAGAG CTAATAGAAA AATACAATGT CACTTTAATT GCTTTGGGAA ATGGAACTGC TTCAAGAGAG AGTGAAATGT TTATAGCAGA GCTTATAAAA GAACTTTCAA GAGAGGTAAA GTATGTGATA GTGAATGAAG CAGGGGCATC AGTCTATTCT GCTTCTCAAA TAGGTACAGA GGAATTTCCT GATATAAACG TAAGTTTAAG AGGAGCTATC TCACTGGCGC GAAGGCTTCA AGACCCATTG GCAGAGCTTG TAAAAATTGA TCCAAAGTCC ATCGGGGTAG GACAGTATCA ACACGATGTA GACCAGAAAA AACTGGGAGA TGCTTTAAAC GGTGTTGTAG AAGATTGTGT AAACAGCGTA GGTGTTGATT TAAACACTGC ATCAGTATCT CTTTTAAAAT ACGTTTCAGG GATAAATGCT GCTATTGCCA AGAATATTGT TGAATATCGA AACCAAATAG GCAAGTTTAC AAATAGGGAG CAGCTTAAAA ATGTAAAAAG ATTAGGAGAT ACTACTTTTA CACAATGTGC GGGTTTTTTG AGAATTTTGG ACGGTGACAA TATCTTTGAC TCAACAGCAG TTCACCCAGA GCGTTACGAA ACCTTAGAAA AGCTTTTGAG AAAATTTAGA TATGAGAAAG AAAAACTAGA TAGAAAAAAG TTAAAAGACT TTGCTAATTC TTTAGAAGAA TACGGATTAG AGAAGATTTC AGAGGAATAC GATATAGGAC TTCCTACACT TTATGACATA GTTTCAGAGC TTAAAAAACC TGGAAGAGAC CCAAGAGAAG ACTTACCCAA GCCCATTTTA AGGTCTGATG TAATGACTAT AAATGAGCTA AAACCAGGCA TGGAACTTAT GGGAACAGTT AGGAATGTCA CTGATTTTGG CTGCTTTGTA GATATAGGTG TTCACACTGA TGGGCTTGTT CACATTTCTG AGATGTCTCA AAATTACATA AAACATCCTC TTGATGTGGT TTCAGTGGGA GACATTGTAA AAGTTAGAGT TTTAAGTGTT GATATTGAGA GAAATAGAAT TTCTCTTTCA ATGAAATAA
|
Protein sequence | MNRIKDILKK EFGLKDFQVE NTIKLIDEGN TIPFIARYRK EATGSLSDEV LRNFYDRLTY LRNLEEKKQD TIRLIDEQGK LTEEIKAKIE NATTLQEVED IYRPFRPKRR TRATIAKEKG LEPLAKVISS NDVTDGDVEE YAKPYLNENV PTVEEAYQGA MDIIAEDISD DADIRKYIRS FTWNNGIIVT QALKEDRSPY EMYYDYKEAV KTIPPHRILA INRAEREKYI SVKIEIDSER IINRLIESKV NKASIFAEYY KKAIEDSYKR LIAPSIEREI RNALTEKAEE KAIIVFKENL KSLLLQPPIK GHVVMGFDPA YRTGCKIAVV DETGKLLDTA TVYPTPPQND FENSKKVLKE LIEKYNVTLI ALGNGTASRE SEMFIAELIK ELSREVKYVI VNEAGASVYS ASQIGTEEFP DINVSLRGAI SLARRLQDPL AELVKIDPKS IGVGQYQHDV DQKKLGDALN GVVEDCVNSV GVDLNTASVS LLKYVSGINA AIAKNIVEYR NQIGKFTNRE QLKNVKRLGD TTFTQCAGFL RILDGDNIFD STAVHPERYE TLEKLLRKFR YEKEKLDRKK LKDFANSLEE YGLEKISEEY DIGLPTLYDI VSELKKPGRD PREDLPKPIL RSDVMTINEL KPGMELMGTV RNVTDFGCFV DIGVHTDGLV HISEMSQNYI KHPLDVVSVG DIVKVRVLSV DIERNRISLS MK
|
| |