Gene Smed_4903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4903 
Symbol 
ID5317880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1412815 
End bp1414074 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content56% 
IMG OID640776687 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_001313619 
Protein GI150377023 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.791512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.702459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCGA TAGATACGAC AGACAAGCCG TTTCGAATTG AGAAGCGGCA AGTGTACGAA 
GCTTACAAAG CGGTCAAAGC CAACCATGGC GCAGCCGGAG TGGACGGTGA GACCTTGGAG
ATGTTCGAGA AAGACCTTGC GAGAAATCTC TACAAGATCT GGAATCGGAT GTCGTCCGGG
ACCTACTTTC CGCCACCGGT GCGCGCCGTC TCCATTCCGA AGAAGACTGG TGGCGAAAGG
GTTTTGGGTG TGCCCACGGT CAGCGATCGG ATCGCGCAGA TGGTGGTCAA GCAGATGATT
GAGCCGGATT TGGATTCCCT CTTTCTTCCG GACTCCTACG GATACAGGCC GGGAAAATCG
GCGTTGGATG CCGTTGGGGT GACGCGTCAG CGGTGCTGGA AGTACGATTG GGTTCTGGAA
TTCGACATCA AAGGGCTGTT TGACAATCTT CCGCATGATC TCTTGCTGAA GGCGGTCAGA
AAGCACGTCA AATGCAACTG GGCTCTGCTC TACATCGAAA GATGGCTGGT CGCGCCCATG
GAAAAGAACG GAGCAGTCAT TGAGCGCACA CGTGGTACCC CGCAAGGGGG CGTGGTCAGC
CCAATCCTCT CGAATCTCTT CCTGCATTAC GCGTTCGACG TCTGGATGAC TCGGACGCAC
CCTGATCTTC CATGGTGTCG GTATGCCGAT GATGGTCTCG TGCACTGCCG GACCGAGCAA
GAAGCACAGG CCCTCAAGGC TGCGCTTCAA GCCCGGCTGG CAGAATGCGG ACTTCAGATG
CATCCGATCA AGACCCAGAT CGTCTACTGC AAAGATAATC GGCGTCGGAA AAGGTATCCG
ACCGTCAAAT TTGACTTCCT TGGATACCAA TTCCGGCCGC GACAGGTGGC GACGGCGCAG
CAGGATGAGT TCTTCTGCGG CTACACCCCG GCGGCCAGCC CGACGGCGCT AAAGTCGATG
CGGGCCACGA TCAAGAGCTT GAACATTCCG CGGCAAACGC CGGGGACGCT GGCTGAAATC
GCCAAACAGA TCAATCCGCT CCTGCGGGGA TGGATTGCCT ATTATGGGCG GTTCAGTCGT
TCGGCCCTGT TCTCTCTGGC TGACTACATC AATCGGAAGC TCAAGGCCTG GATTATGCGA
AAGTACAAGC GCTTTCGGTT CCACAAAACT CGGGCTTCGC AGTTCTTGCG GCAACTTGCT
CGAGATAATC GGGGCCTCTT CGTACACTGG CAGGCGTTCG GAACGAACCT GTTTGCCTGA
 
Protein sequence
MTSIDTTDKP FRIEKRQVYE AYKAVKANHG AAGVDGETLE MFEKDLARNL YKIWNRMSSG 
TYFPPPVRAV SIPKKTGGER VLGVPTVSDR IAQMVVKQMI EPDLDSLFLP DSYGYRPGKS
ALDAVGVTRQ RCWKYDWVLE FDIKGLFDNL PHDLLLKAVR KHVKCNWALL YIERWLVAPM
EKNGAVIERT RGTPQGGVVS PILSNLFLHY AFDVWMTRTH PDLPWCRYAD DGLVHCRTEQ
EAQALKAALQ ARLAECGLQM HPIKTQIVYC KDNRRRKRYP TVKFDFLGYQ FRPRQVATAQ
QDEFFCGYTP AASPTALKSM RATIKSLNIP RQTPGTLAEI AKQINPLLRG WIAYYGRFSR
SALFSLADYI NRKLKAWIMR KYKRFRFHKT RASQFLRQLA RDNRGLFVHW QAFGTNLFA