Gene Smed_1348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1348 
Symbol 
ID5322196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1432667 
End bp1433842 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content57% 
IMG OID640790290 
Productphage integrase family protein 
Protein accessionYP_001327033 
Protein GI150396566 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGGA ACCTTCTCAC CGTCACTGAG ATCAAGAATT CCCCGAAGCC GAAACTTCGG 
GACGGAGATG GCTTGTGGCT GCACACCAGT AGCTCCGGAA ACCGCCATTT TGTGTTCATC
TATATAAGGC ATGGGCGTCG CCGCGAAATG GGCCTAGGCA CATATGGAAC TGGAACCGGC
CAGGTTAGTC TCGCAGCCGC CCGAACAAAA GCGGAGGAGA TACGAACCAT CCTCGGGCGC
GGAGGCGACC CATTCACCGA GATGGGGGAG CGCCAAGAAA AGGTGAAGCC CACAACTTTC
GGACAATGCG CCGATGATCT CGTTGACGCC ATGGAATCGC AATGGCGAAA TGAAAAGCAC
CGCGCGCAGT GGCGAATGAC GCTGACCGAA TACGCGAAGG CGATTCGAAA GCTACCCGTT
GCGGAGGTGA CGACCGATGA CGTTGTTCGT GTTCTGAAGC CAATATGGAG CACAAAGGCG
GAGACGGCAT CTCGTCTTCG CGGTCGAATT GAAAAGGTGC TCGACCACGC GAAAGTCCGC
GGCCTGCGAA CTGGAGAGAA CCCAGCTCGA TGGAAAGGAC ACCTCGACCA TATCCTGCCA
AAGGCCGGGA AGCTGAAGCG AGGACATCAC GCCGCGATGC CGTATGCGGA CGTGCCGGCC
TTCATTAAGA AGATTCGAGA GGCGTCCGGA GTTGGCGCCC GCGCGCTCGA GTTCACTATT
CTGACTGCGT CACGTACGGG CGAAACCATG GGAGCCAAGT GGGCTGAATT CGACTTCAGG
GAAAACGTAT GGACCGTACC CGCCGAGCGG ATGAAGGGGG GACGCGAACA TCGGGTTCCC
TTGACCGACC GAGTTCTGGC CGTACTTACC GAGATGAAGA AGCGATCGGT CAATGACTTC
GTGTTCCCCG GTTCGAAAGC AAACACGCCG ATTAGCAACA TGACGATGAC CAAGGTTATG
AAAACGTATG AGGCAGACGC CTTTACCGTG CATGGTTTTA GGTCAGCCTT CCGGGACTGG
GCATCCGAGG AGACCGAATT CCAGGGCGAG GTCGCCGAGG CCGCATTGGC TCATATTACA
GGCGACGAGA CCGAGCGCGC CTATCGCCGC GGCGATGTTC TGGAAAAGCG TCGGAAACTG
ATGGAGGCGT GGGAGACGTA CTGCGAAGTA GTGTAA
 
Protein sequence
MARNLLTVTE IKNSPKPKLR DGDGLWLHTS SSGNRHFVFI YIRHGRRREM GLGTYGTGTG 
QVSLAAARTK AEEIRTILGR GGDPFTEMGE RQEKVKPTTF GQCADDLVDA MESQWRNEKH
RAQWRMTLTE YAKAIRKLPV AEVTTDDVVR VLKPIWSTKA ETASRLRGRI EKVLDHAKVR
GLRTGENPAR WKGHLDHILP KAGKLKRGHH AAMPYADVPA FIKKIREASG VGARALEFTI
LTASRTGETM GAKWAEFDFR ENVWTVPAER MKGGREHRVP LTDRVLAVLT EMKKRSVNDF
VFPGSKANTP ISNMTMTKVM KTYEADAFTV HGFRSAFRDW ASEETEFQGE VAEAALAHIT
GDETERAYRR GDVLEKRRKL MEAWETYCEV V