Gene Smed_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1223 
Symbol 
ID5322070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1306022 
End bp1307281 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content56% 
IMG OID640790164 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_001326908 
Protein GI150396441 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0131896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0593975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCGA TAGATACGAC AGACAAGCCG TTTCGAATTG AGAAGCGGCA AGTGTACGAA 
GCTTACAAAG CGGTCAAAGC CAACCATGGC GCAGCCGGAG TGGACGGTGA GACCTTGGAG
ATGTTCGAGA AAGACCTTGC GAGAAATCTC TACAAGATCT GGAATCGGAT GTCGTCCGGG
ACCTACTTTC CGCCACCGGT GCGCGCCGTC TCCATTCCGA AGAAGACTGG TGGCGAAAGG
GTTTTGGGTG TGCCCACGGT CAGCGATCGG ATCGCGCAGA TGGTGGTCAA GCAGATGATT
GAGCCGGATT TGGATTCCCT CTTTCTTCCG GACTCCTACG GATACAGGCC GGGAAAATCG
GCGTTGGATG CCGTTGGGGT GACGCGTCAG CGGTGCTGGA AGTACGATTG GGTTCTGGAA
TTCGACATCA AAGGGCTGTT TGACAATCTT CCGCATGATC TCTTGCTGAA GGCGGTCAGA
AAGCACGTCA AATGCAACTG GGCTCTGCTC TACATCGAAA GATGGCTGGT CGCGCCCATG
GAAAAGAACG GAGCAGTCAT TGAGCGCACA CGTGGTACCC CGCAAGGGGG CGTGGTCAGC
CCAATCCTCT CGAATCTCTT CCTGCATTAC GCGTTCGACG TCTGGATGAC TCGGACGCAC
CCTGATCTTC CATGGTGTCG GTATGCCGAT GATGGTCTCG TGCACTGCCG GACCGAGCAA
GAAGCACAGG CCCTCAAGGC TGCGCTTCAA GCCCGGCTGG CAGAATGCGG ACTTCAGATG
CATCCGATCA AGACCCAGAT CGTCTACTGC AAAGATAATC GGCGTCGGAA AAGGTATCCG
ACCGTCAAAT TTGACTTCCT TGGATACCAA TTCCGGCCGC GACAGGTGGC GACGGCGCAG
CAGGATGAGT TCTTCTGCGG CTACACCCCG GCGGCCAGCC CGACGGCGCT AAAGTCGATG
CGGGCCACGA TCAAGAGCTT GAACATTCCG CGGCAAACGC CGGGGACGCT GGCTGAAATC
GCCAAACAGA TCAATCCGCT CCTGCGGGGA TGGATTGCCT ATTATGGGCG GTTCAGTCGT
TCGGCCCTGT TCTCTCTGGC TGACTACATC AATCGGAAGC TCAAGGCCTG GATTATGCGA
AAGTACAAGC GCTTTCGGTT CCACAAAACT CGGGCTTCGC AGTTCTTGCG GCAACTTGCT
CGAGATAATC GGGGCCTCTT CGTACACTGG CAGGCGTTCG GAACGAACCT GTTTGCCTGA
 
Protein sequence
MTSIDTTDKP FRIEKRQVYE AYKAVKANHG AAGVDGETLE MFEKDLARNL YKIWNRMSSG 
TYFPPPVRAV SIPKKTGGER VLGVPTVSDR IAQMVVKQMI EPDLDSLFLP DSYGYRPGKS
ALDAVGVTRQ RCWKYDWVLE FDIKGLFDNL PHDLLLKAVR KHVKCNWALL YIERWLVAPM
EKNGAVIERT RGTPQGGVVS PILSNLFLHY AFDVWMTRTH PDLPWCRYAD DGLVHCRTEQ
EAQALKAALQ ARLAECGLQM HPIKTQIVYC KDNRRRKRYP TVKFDFLGYQ FRPRQVATAQ
QDEFFCGYTP AASPTALKSM RATIKSLNIP RQTPGTLAEI AKQINPLLRG WIAYYGRFSR
SALFSLADYI NRKLKAWIMR KYKRFRFHKT RASQFLRQLA RDNRGLFVHW QAFGTNLFA