Gene Smed_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1847 
Symbol 
ID5322705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1922970 
End bp1924259 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content63% 
IMG OID640790785 
Productintegrase catalytic region 
Protein accessionYP_001327517 
Protein GI150397050 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.154194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCG CGATTTTATC GCATCCTGCA TTGTCGCTAA CGATGCAGGG CGGGGATATG 
CTTCAGGCTG ACGAGGTGGT GGCAATGCTG CGGCTGCACG AGCTTGGTTG GGGTAGCAAA
CGTCTATCGA AAGAATTTGG ATGCGCGCGG AATACCGTTC GCCGATATCT TCGAGAGGGC
GGAGCTGTAC CGTTTAAACA GCCTGCCCGG CGCAGTGCGT TCGACGGGCT TGATGATTGG
CTTCGCGAGC GTTTTTTCCG GCATGACGGT AATGCGGATG TGATCCGCCA AGAGTTGGCG
AGCGAGCACG GAATTGTCAT CGGTCTGCGT TCTGTGGAGC TCCGCGTACG GCAGTGGCGG
CGAGAGCTAA AGGCACAGAA GCGGGCGACG GTCCGCTTCG AGACGGCGCC GGGCCATCAG
ATGCAGATCG ACTTCGGTGA CACGAAGGTG TGGATCGGCG ACGAGCGGGT TCGGATTCAC
CTGTTCGTGG GGACGCTGGG GTATTCGCGG CGGATGCATG CTCGCGCGTC ACTCAGGGAG
CGCCAGGCAG ACTGGTTTGA AGGCATGGAA GGCGCTTTCC TGCGGTTCGG CGGGGTTCCG
GTGGAAGTGC TGATCGACAA TGCGAAGGCC CTGGTCGAAC ATCATGATCC GGTGACGCGA
GAGGTGAGAT TCAACGCGCG ACTGCATGCT TTCGCCCGTT ATTGGGGCTT CACGCCGCGG
GCCTGCGCAC CGTATCGGGC GAGAACGAAA GGCAAAGACG AGCGCGGGGT CGGTTACGTC
AAGAAGAACG CGATCGCCGG GCGCCGCTTC GAGAGCTGGG CCGAGTTTGA AGCGCATCTG
GATCGATGGA CACGCGAAGT TGCCGACCAG CGTGAACACG GCACCACCGG TGTCAAACCG
GCGGAACGCT TTGCCGACGA AGCCAGGGAG CTGCGCCCGC TGGCCGGACG GGCACCCTTC
GGGCAATTGC GGGATCTGGT TCGCAAGGTT CAAGCCGATT GCGCGATCGA CCTCGACACC
AACAGCTACT CGGTGCCCTG GCGCCTGATC GGCGAGAGTG TTCAGGTCGT GGTGTTGGCG
GGGCGCGTCA TCATCCGGCA TGCGGGCCAG GTGGTGGCTG ATCATGCCCT GTGCGATGGA
CGACGACAAC GGATCGTGGA CCGGGCGCAT TTTGTCGGTG TTGCCGGCGC CGAGGGTCTG
GTGCGAGCCG CCGCTCCCAT CGAGCTCCCC CCTCCCACCC TGTTGCGCCC GCTTGCGGAA
TACGAGGCGG TTGCCGGAGG AGGCTGGTGA
 
Protein sequence
MTSAILSHPA LSLTMQGGDM LQADEVVAML RLHELGWGSK RLSKEFGCAR NTVRRYLREG 
GAVPFKQPAR RSAFDGLDDW LRERFFRHDG NADVIRQELA SEHGIVIGLR SVELRVRQWR
RELKAQKRAT VRFETAPGHQ MQIDFGDTKV WIGDERVRIH LFVGTLGYSR RMHARASLRE
RQADWFEGME GAFLRFGGVP VEVLIDNAKA LVEHHDPVTR EVRFNARLHA FARYWGFTPR
ACAPYRARTK GKDERGVGYV KKNAIAGRRF ESWAEFEAHL DRWTREVADQ REHGTTGVKP
AERFADEARE LRPLAGRAPF GQLRDLVRKV QADCAIDLDT NSYSVPWRLI GESVQVVVLA
GRVIIRHAGQ VVADHALCDG RRQRIVDRAH FVGVAGAEGL VRAAAPIELP PPTLLRPLAE
YEAVAGGGW