Gene GWCH70_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1723 
Symbol 
ID7978645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1801716 
End bp1802978 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content41% 
IMG OID644798577 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_002949749 
Protein GI239827125 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTTGA ATCAAATCCT GTCACGGGAG AATGTGCTTC AAGCACTAAA ACGTGTAGAA 
CAGAATAAAG GAAGCCACGG AGTAGATATG ATGCCCGTAC AAAATCTACG ACAGCACATA
GTTGAAAACT GGCTATCGAT TAAAGAAGCA ATTCTCAAGG GAACTTATGA ACCGATGCCA
GTCCGCAGAG TCGAAATCCC GAAACCTGAT GGCGGAGTTC GCTTACTAGG AATCCCTACC
GTAACAGACC GTTTGATTCA ACAAGCAATC GCCCAAGTAC TTTCAAAAGT GTATGACCCT
ACATTCTCTG AAAACAGCTA CGGATTTCGA CCAAACCGAA GTGCTCATGA TGCGGTGAGG
AAAGCGAAAG AATATATAAG AGATGGATAT CGATGGGTTG TAGATATGGA CTTGGAGAAA
TTCTTTGATA AGGTCAACCA TGACAGATTA ATGGGTACAC TCGCGAAGAG AATCCAAGAT
AAACCATTAC TGAAATTGAT TCGTAAGTAT TTACAATCGG GCGTCATGAT TGATGGTGTG
GTGTCAAGCA CATTAGAAGG AACTCCACAA GGAGGACCAT TAAGTCCGCT ACTATCTAAC
ATTGTACTAG ATGAACTAGA TAAAGAATTG GAAAGAAGAG GACACAAATT CGTTCGATAT
GCGGATGACT GTAACATTTA CGTGAAAAGT AAACGAGCAG GACTTCGCAC AATGGCAAGT
ATCCAGCGAT TTATTGAAGG AACACTACGA CTGAAAGTAA ATGAAAAGAA ATCAGCGGTC
GACCGTCCAT GGAAACGTAA GTTTCTAGGA TTTAGCTTTA CCTATCATAA AGAGCCAAAG
GTTCGTATCG CAAAAGAAAG CCTTAAACGA ATGAAGAATA AAGTTCGTGA AATCACATCA
CGCAAGATGC CCTACCCGAT GGAATACCGC ATTCAGAAAC TGAATCAATA TCTAATGGGA
TGGTGTGGAT ATTTTGCGCT AGCAGACACC AAATCTATAT TCCCTGAATT AGATAAATGG
ATTCGTAGAA GACTTCGAAT GTGTCTATGG AAGAACTGGA AGAAACCGAA AACAAAGATA
CGCAACCTTA TTCAACTTGG CGTACCACAA TGGCAAGCGT ATGAATGGGG AAATACTCGG
AAGAGTTATT GGCGTATTTC AAAAAGTCCA ATATTACACA GAACCCTTGG TAACTCCTAT
TGGAGAAACC AAGGGCTGAA AAGTCTTGAA GCTCGTTATG AAAACTTGCG TCAATTATCT
TAA
 
Protein sequence
MLLNQILSRE NVLQALKRVE QNKGSHGVDM MPVQNLRQHI VENWLSIKEA ILKGTYEPMP 
VRRVEIPKPD GGVRLLGIPT VTDRLIQQAI AQVLSKVYDP TFSENSYGFR PNRSAHDAVR
KAKEYIRDGY RWVVDMDLEK FFDKVNHDRL MGTLAKRIQD KPLLKLIRKY LQSGVMIDGV
VSSTLEGTPQ GGPLSPLLSN IVLDELDKEL ERRGHKFVRY ADDCNIYVKS KRAGLRTMAS
IQRFIEGTLR LKVNEKKSAV DRPWKRKFLG FSFTYHKEPK VRIAKESLKR MKNKVREITS
RKMPYPMEYR IQKLNQYLMG WCGYFALADT KSIFPELDKW IRRRLRMCLW KNWKKPKTKI
RNLIQLGVPQ WQAYEWGNTR KSYWRISKSP ILHRTLGNSY WRNQGLKSLE ARYENLRQLS