Gene Smed_5624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5624 
Symbol 
ID5319926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp592144 
End bp593433 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content63% 
IMG OID640777367 
Productintegrase catalytic region 
Protein accessionYP_001314299 
Protein GI150377704 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.205652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.723097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCG CGATTTTATC GCATCCTGCA TTGTCGCTAA CGATGCAGGG CGGGGATATG 
CTTCAGGCTG ACGAGGTGGT GGCAATGCTG CGGCTGCACG AGCTTGGTTG GGGTAGCAAA
CGTCTATCGA AAGAATTTGG ATGCGCGCGG AATACCGTTC GCCGATATCT TCGAGAGGGC
GGAGCTGTAC CGTTTAAACA GCCTGCCCGG CGCAGTGCGT TCGACGGGCT TGATGATTGG
CTTCGCGAGC GTTTTTTCCG GCATGACGGT AATGCGGATG TGATCCGCCA AGAGTTGGCG
AGCGAGCACG GAATTGTCAT CGGTCTGCGT TCTGTGGAGC TCCGCGTACG GCAGTGGCGG
CGAGAGCTAA AGGCACAGAA GCGGGCGACG GTCCGCTTCG AGACGGCGCC GGGCCATCAG
ATGCAGATCG ACTTCGGTGA CACGAAGGTG TGGATCGGCG ACGAGCGGGT TCGGATTCAC
CTGTTCGTGG GGACGCTGGG GTATTCGCGG CGGATGCATG CTCGCGCGTC ACTCAGGGAG
CGCCAGGCAG ACTGGTTTGA AGGCATGGAA GGCGCTTTCC TGCGGTTCGG CGGGGTTCCG
GTGGAAGTGC TGATCGACAA TGCGAAGGCC CTGGTCGAAC ATCATGATCC GGTGACGCGA
GAGGTGAGAT TCAACGCGCG ACTGCATGCT TTCGCCCGTT ATTGGGGCTT CACGCCGCGG
GCCTGCGCAC CGTATCGGGC GAGAACGAAA GGCAAAGACG AGCGCGGGGT CGGTTACGTC
AAGAAGAACG CGATCGCCGG GCGCCGCTTC GAGAGCTGGG CCGAGTTTGA AGCGCATCTG
GATCGATGGA CACGCGAAGT TGCCGACCAG CGTGAACACG GCACCACCGG TGTCAAACCG
GCGGAACGCT TTGCCGACGA AGCCAGGGAG CTGCGCCCGC TGGCCGGACG GGCACCCTTC
GGGCAATTGC GGGATCTGGT TCGCAAGGTT CAAGCCGATT GCGCGATCGA CCTCGACACC
AACAGCTACT CGGTGCCCTG GCGCCTGATC GGCGAGAGTG TTCAGGTCGT GGTGTTGGCG
GGGCGCGTCA TCATCCGGCA TGCGGGCCAG GTGGTGGCTG ATCATGCCCT GTGCGATGGA
CGACGACAAC GGATCGTGGA CCGGGCGCAT TTTGTCGGTG TTGCCGGCGC CGAGGGTCTG
GTGCGAGCCG CCGCTCCCAT CGAGCTCCCC CCTCCCACCC TGTTGCGCCC GCTTGCGGAA
TACGAGGCGG TTGCCGGAGG AGGCTGGTGA
 
Protein sequence
MTSAILSHPA LSLTMQGGDM LQADEVVAML RLHELGWGSK RLSKEFGCAR NTVRRYLREG 
GAVPFKQPAR RSAFDGLDDW LRERFFRHDG NADVIRQELA SEHGIVIGLR SVELRVRQWR
RELKAQKRAT VRFETAPGHQ MQIDFGDTKV WIGDERVRIH LFVGTLGYSR RMHARASLRE
RQADWFEGME GAFLRFGGVP VEVLIDNAKA LVEHHDPVTR EVRFNARLHA FARYWGFTPR
ACAPYRARTK GKDERGVGYV KKNAIAGRRF ESWAEFEAHL DRWTREVADQ REHGTTGVKP
AERFADEARE LRPLAGRAPF GQLRDLVRKV QADCAIDLDT NSYSVPWRLI GESVQVVVLA
GRVIIRHAGQ VVADHALCDG RRQRIVDRAH FVGVAGAEGL VRAAAPIELP PPTLLRPLAE
YEAVAGGGW