Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1063 |
Symbol | |
ID | 7409620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1158451 |
End bp | 1160121 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643715429 |
Product | transcription termination factor Rho |
Protein accession | YP_002572937 |
Protein GI | 222529055 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000384674 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCTGGGT CACTTGATGA GTTTTTAAAA GGAAAGTCTA TTATTGAACT TAGAGAAATA GCAAAAAGTC TGGGTATTCA GAAATATTCT TTGCTCAAAA AAGGTGAATT GATGGAAGCT ATTAGAAATT TTCTGGGAAG TTCAAAAGAA GATGCCACAG TTCTTGAGGG CATAGGCAAA AAAGAAAAAG GAAGAAGAGG AAGAAAAAAG AAATCAGAAA TTGCAGAAGT TCAGCAGACT TTTGAATTAA AGAGCGAGGA GCTACAATCA AAGGTTGGAG AGACCAATGA GAAGGAAGAG GAAAGACCAA GTGAAATGGA AAATAAAGAA GAAGATAACA TAAAAGAACA GACTTTGTCT GAGGATATAC AAAAAGAAGA AGAAAAAACT GAAATGCCAA CAGATGTTAG TTTAAATGAG GAGAAAAAAG AGGAGAGTAA AGTCATAGAG TTAAAACCTA AAAAAGAAGA AAAAAGAGAG GATAAAATGC AGATAGAAAT TCCGCCAGAA TTAAAAGAAC TTGAAGGTAA AGTGGAGATA GGCGGCATAG GCGAAGGAGT TTTGGAGATA ATTTATGAGC CTGGTGGCGG TGGCGGTTAC GGATTTTTGC GCGACGATTC GTTCGTTCCT GGCCCAAACG ACATATATGT TTCGCCATCT CAAATCAGAA AATTCAATCT CAAAACCGGA GATAAAATTA GAGGTCCCAT TAGGCTTCCG AAAGAAAATG AGAAGTTTGC AGGGCTTTTG TATGTTCAGA GCGTCAACGA TATGAAACCA GAAGAAGTTG CAAAACGCAC TCCTTTTGAA GACCTTACCC CTATTTTCCC AAACAAAAGA ATAATTTTGG AAAATAAAAA TGAGCCTAAG GATTTAGCAG TTAGACTCAT AGACCTTATT GCGCCAATTG GAAGAGGACA GAGAGGATTA ATTGTAGCAC CACCAAAAGC AGGTAAAACT ACGCTATTAA AGAAAATAGC AAATAGTATT CTGACAAACT ATGATGATTT GCATTTGATT GTACTGCTCA TTGATGAAAG ACCCGAAGAG GTCACTGACA TGCAAGATTC AATAAAAGCA GAGATACATT ACTCTACATT TGATGAAACT CCTGAACACC ACATAAAAGT TGCTGAAATG GTTTTAGAGA GGGCTATGAG ACTTGTTGAG TGTAAAAAAG ATGTTGTCAT TTTGTTAGAT AGCTTGACAA GGCTTGCACG TGCTTATAAC TTAGTTGAAC CACCTTCTGG CAGAACACTT TCTGGCGGTC TTGACCCGAA CGCTCTTCAT AAACCTAAAA AGTTTTTTGG TGCTGCAAGA AACCTTAAAG AAGGTGGTAG TCTTACTATC CTTGCAACAG CGTTGATTGA AACAGGGTCA CGAATGGATG ATGTCATATT TGAAGAGTTC AAGGGTACTG GCAACATGGA GTTGCACCTT GACAGAAAAC TTTCTGAAAA ACGAATATTC CCGGCTATTG ATATAAACAA GTCAGGGACG CGAAGAGAGG AACTTTTGCT GTCTGAAGAG GAAAAAGCAG CTGTTGATGC TATCAGAAGA GCACTTTCTA ACTTTGGAAC AGCTGAAACT ACAGAGAGAA TTATAAGTAT GCTTTCCCAA ACAAAGTCAA ATGAAGAATT CATAAGAAAG ATATTACAAA ATTTAAGATA A
|
Protein sequence | MPGSLDEFLK GKSIIELREI AKSLGIQKYS LLKKGELMEA IRNFLGSSKE DATVLEGIGK KEKGRRGRKK KSEIAEVQQT FELKSEELQS KVGETNEKEE ERPSEMENKE EDNIKEQTLS EDIQKEEEKT EMPTDVSLNE EKKEESKVIE LKPKKEEKRE DKMQIEIPPE LKELEGKVEI GGIGEGVLEI IYEPGGGGGY GFLRDDSFVP GPNDIYVSPS QIRKFNLKTG DKIRGPIRLP KENEKFAGLL YVQSVNDMKP EEVAKRTPFE DLTPIFPNKR IILENKNEPK DLAVRLIDLI APIGRGQRGL IVAPPKAGKT TLLKKIANSI LTNYDDLHLI VLLIDERPEE VTDMQDSIKA EIHYSTFDET PEHHIKVAEM VLERAMRLVE CKKDVVILLD SLTRLARAYN LVEPPSGRTL SGGLDPNALH KPKKFFGAAR NLKEGGSLTI LATALIETGS RMDDVIFEEF KGTGNMELHL DRKLSEKRIF PAIDINKSGT RREELLLSEE EKAAVDAIRR ALSNFGTAET TERIISMLSQ TKSNEEFIRK ILQNLR
|
| |