Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1723 |
Symbol | |
ID | 7978645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1801716 |
End bp | 1802978 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644798577 |
Product | RNA-directed DNA polymerase (Reverse transcriptase) |
Protein accession | YP_002949749 |
Protein GI | 239827125 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTGA ATCAAATCCT GTCACGGGAG AATGTGCTTC AAGCACTAAA ACGTGTAGAA CAGAATAAAG GAAGCCACGG AGTAGATATG ATGCCCGTAC AAAATCTACG ACAGCACATA GTTGAAAACT GGCTATCGAT TAAAGAAGCA ATTCTCAAGG GAACTTATGA ACCGATGCCA GTCCGCAGAG TCGAAATCCC GAAACCTGAT GGCGGAGTTC GCTTACTAGG AATCCCTACC GTAACAGACC GTTTGATTCA ACAAGCAATC GCCCAAGTAC TTTCAAAAGT GTATGACCCT ACATTCTCTG AAAACAGCTA CGGATTTCGA CCAAACCGAA GTGCTCATGA TGCGGTGAGG AAAGCGAAAG AATATATAAG AGATGGATAT CGATGGGTTG TAGATATGGA CTTGGAGAAA TTCTTTGATA AGGTCAACCA TGACAGATTA ATGGGTACAC TCGCGAAGAG AATCCAAGAT AAACCATTAC TGAAATTGAT TCGTAAGTAT TTACAATCGG GCGTCATGAT TGATGGTGTG GTGTCAAGCA CATTAGAAGG AACTCCACAA GGAGGACCAT TAAGTCCGCT ACTATCTAAC ATTGTACTAG ATGAACTAGA TAAAGAATTG GAAAGAAGAG GACACAAATT CGTTCGATAT GCGGATGACT GTAACATTTA CGTGAAAAGT AAACGAGCAG GACTTCGCAC AATGGCAAGT ATCCAGCGAT TTATTGAAGG AACACTACGA CTGAAAGTAA ATGAAAAGAA ATCAGCGGTC GACCGTCCAT GGAAACGTAA GTTTCTAGGA TTTAGCTTTA CCTATCATAA AGAGCCAAAG GTTCGTATCG CAAAAGAAAG CCTTAAACGA ATGAAGAATA AAGTTCGTGA AATCACATCA CGCAAGATGC CCTACCCGAT GGAATACCGC ATTCAGAAAC TGAATCAATA TCTAATGGGA TGGTGTGGAT ATTTTGCGCT AGCAGACACC AAATCTATAT TCCCTGAATT AGATAAATGG ATTCGTAGAA GACTTCGAAT GTGTCTATGG AAGAACTGGA AGAAACCGAA AACAAAGATA CGCAACCTTA TTCAACTTGG CGTACCACAA TGGCAAGCGT ATGAATGGGG AAATACTCGG AAGAGTTATT GGCGTATTTC AAAAAGTCCA ATATTACACA GAACCCTTGG TAACTCCTAT TGGAGAAACC AAGGGCTGAA AAGTCTTGAA GCTCGTTATG AAAACTTGCG TCAATTATCT TAA
|
Protein sequence | MLLNQILSRE NVLQALKRVE QNKGSHGVDM MPVQNLRQHI VENWLSIKEA ILKGTYEPMP VRRVEIPKPD GGVRLLGIPT VTDRLIQQAI AQVLSKVYDP TFSENSYGFR PNRSAHDAVR KAKEYIRDGY RWVVDMDLEK FFDKVNHDRL MGTLAKRIQD KPLLKLIRKY LQSGVMIDGV VSSTLEGTPQ GGPLSPLLSN IVLDELDKEL ERRGHKFVRY ADDCNIYVKS KRAGLRTMAS IQRFIEGTLR LKVNEKKSAV DRPWKRKFLG FSFTYHKEPK VRIAKESLKR MKNKVREITS RKMPYPMEYR IQKLNQYLMG WCGYFALADT KSIFPELDKW IRRRLRMCLW KNWKKPKTKI RNLIQLGVPQ WQAYEWGNTR KSYWRISKSP ILHRTLGNSY WRNQGLKSLE ARYENLRQLS
|
| |