Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0483 |
Symbol | |
ID | 7978632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 537807 |
End bp | 539066 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644797460 |
Product | RNA-directed DNA polymerase (Reverse transcriptase) |
Protein accession | YP_002948660 |
Protein GI | 239826036 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000125765 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGATGA AACAGATACT GTCACGGGAG AATCTCCTGC GAGCACTCAA ACAAGTGGAA AAGAATAAAG GGTCCCATGG AACCGATGGA ATGTCCGTCA AAGACCTGCG AAGACACCTC GTGGAACATT GGGACGTGAT ACGGCGTGCT TTGGAAGAAG GGACCTACGA ACCTTGCCCG GTCCGACGGG TCGAAATCCC GAAACCGAAC GGAGGAGTCA GGTTACTAGG AATCCCGACC GTGACAGACC GGTTCATCCA ACAGGCCATC GCCCAAGTGC TCACGCCGAT CTTTGACCCA TCCTTTTCGG AACACAGCTA CGGGTTTCGT CCCGGTCGAA GAGGACACGA CGCGGTGAAA AAGGCGAAGC AGTATATTCA GGAAGGATAT ACATGGGTGG TAGATATCGA CTTGGAAAAG TTCTTTGATC GAGTCAACCA TGACAAACTG ATGGGGATAT TAGCGAAACG AATTCCAGAC AAAATCCTCC TAAAGTTGAT ACGGAAGTAT TTACAGGCAG GGGTCATGAT CAACGGGGTG GTCATGGAAA CACAAGAGGG GACTCCACAA GGAGGGCCGC TCAGTCCACT TTTGTCCAAC ATTCTCTTGG ATGAGCTGGA CAAAGAATTG GAAAAACGAG GGCACAAATT TGTACGGTAT GCGGATGACT GCAATATCTA CGTAAGGACG AAGAAGGCCG GGGAACGGGT GATGAAATCG ATCACGGCAT TCATTGAAAA GAAACTCCGG CTGAAAGTCA ACGAAACCAA ATCGGCAGTG GATCGACCGT GGAGGAGAAA ATTCCTCGGT TTTAGCTTCA CCCCAAATAA GGAGCCAAAA ATCCGGATCG CAAAGGAAAG TATTCGGCGC ATGAAGCAAA GGATGCGCAC CATGACGAGT CGATCGAAAC CGATTCCCAT GCTCGAGCGA ATCGAACAGC TCAACCAGTA CATTCTGGGA TGGTGTGGAT ACTTCTCGTT AGCAGAGACT CCAAGTGTGT TCAAAGAACT AGATGGATGG ATTCAACGAA GGCTGCGCAT GTGCCAATGG AAAGAGTGGA AACTTCCGAG AACCAGAGTC CGAAAACTGC AAAGTTTAGG AGTGCCCAAG CGGAAAGCAT ATGAATGGGG AAACACTCGG AAGAAATATT GGAGAGTGGC CGCTAGTCCC ATCTTGCATA AAGCCCTTGG CAACTCCTAT TGGGAGAGCC AAGGGCTGAA GAGTCTTTAT CAACGATATG AATCTCTGCG TCAGACTTAA
|
Protein sequence | MWMKQILSRE NLLRALKQVE KNKGSHGTDG MSVKDLRRHL VEHWDVIRRA LEEGTYEPCP VRRVEIPKPN GGVRLLGIPT VTDRFIQQAI AQVLTPIFDP SFSEHSYGFR PGRRGHDAVK KAKQYIQEGY TWVVDIDLEK FFDRVNHDKL MGILAKRIPD KILLKLIRKY LQAGVMINGV VMETQEGTPQ GGPLSPLLSN ILLDELDKEL EKRGHKFVRY ADDCNIYVRT KKAGERVMKS ITAFIEKKLR LKVNETKSAV DRPWRRKFLG FSFTPNKEPK IRIAKESIRR MKQRMRTMTS RSKPIPMLER IEQLNQYILG WCGYFSLAET PSVFKELDGW IQRRLRMCQW KEWKLPRTRV RKLQSLGVPK RKAYEWGNTR KKYWRVAASP ILHKALGNSY WESQGLKSLY QRYESLRQT
|
| |