Gene Smed_5770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5770 
Symbol 
ID5320072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp737068 
End bp739362 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content61% 
IMG OID640777477 
ProductTPR repeat-containing protein 
Protein accessionYP_001314409 
Protein GI150377814 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID[TIGR01905] doubled CXXCH domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0786479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.562645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGCGGT TTGTCCACGC AGCGACAGTT ATCCGGACAG CTTTGCTGTT TGGCGCCCTG 
CTCGCGTATG CCGCACCGGT TGGGGGTGCC GAGACGGCTC CGCCTTATGC TGCGGACCAG
CTAATCGCAC CCCAGAAGGG ATTCGTCGAC GAGAAGACGT GTACCTCCTG CCATGCGGAA
CAGGCCGCGG CATTCGCTAA GTCGCACCAC GCCAAGGCCA TGGCCCTTGC CGATGACAAA
TCGGTGCGCG CCGATTTCAA CAGCGTCCGG TTCGAGCGCG ATGGCGTGGC GGCAGAGTTC
TTCCGCCGCA ATGGTCGCTT CTTCATTCGC ACCGAAGGGC CGGGCGGCAA GCAGGCGGAC
TTTGAGGTCA AATATACCTT CGCATACGAA CCCCTGCAGC AGTACCTGGT CGATATCGGC
CACGGGCGAT TGCAGGCCTT TGATATCGCC TGGGACACTC AAAAACAGGA ATGGTTCTGG
CTGGGTGAAG GCAGTGCGGC AAAACCTGGC TCAACCTTTC ATTGGACGGG CCCGTTCTAT
CGTTGGAACC GCACCTGCAT CGACTGTCAT TCCACCGGTC CGCAGACCAA TTTCGAGCCG
CAGACGAACA AATACCAGTC GTCCTATGTC GCCACCAGCA TTGGCTGCCA GTCCTGCCAC
GGCGGCGGTG CGAAGCATGT CGAATGGGCC AGAGCCAAAG CGGCAAACGC TTCAGCGGCG
GCGGCAGATC CCGGCCTCGC AGAGGTTGAC GCGAACACCT GTTTCGCCTG TCACGCCCGA
CGCACTAGAC TGGTCGATGG CTATCGCCCG GGAAGCGCGT TTCTCGACCA GTTTTCTCCT
GCATTGCTTC GCAGCGATCT CTATTTTCCG GACGGGCAGA TACTGGACGA GGTTTTCGAA
TACGGCTCCT TTCAGCAAAG CAAGATGGCA AAGGCAGGCG TCACCTGTTT CGACTGCCAC
CGGGCGCACG AGGGTACCGT CAAAACTGAG GGCAATGCGT TGTGCACGCA ATGCCACGCC
GAGACTGCCC CGGAACGTTT TGCAGGCAAC AACCCGAGCG GCGCGTTTGA CACCCCGGCA
CACACGCATC ATCCCCAAGG CTCCTCCGGC GCTTTTTGTG CCAATTGTCA CATGCCGGAG
CGCACTTACA TGAAGGTCGA TCCCCGGCGC GATCATTCTT TCGTTATCCC CCGGCCGGAT
CTTTCCGCAC TCTACGGAAC GCCGAATGCC TGCATTTCGT GCCATGCGAG CCAAACGAAC
GCCTGGGCGT CCGAGCACCT GGACGGGTGG TATGGCAAGG CATGGCGTGA ACGCCCGTCG
ATTGCGCATG CATTCGCGCG AGCCGCGCAG AACGATGTTG CCGCGATTGA AAGTCTGCGC
AGATTTCTGA CCGATCGGGA ACAGCCGGGC ATCATCAGGG GCAGCGCGAT CGGTGAAATG
ACAAGGCTCG ATGGAGCAGC CACCGCAGCG GATGTCAGAG TGGCCGCGGG CGATCCCGAT
CCGCTCGTTA GGATGGGGGC GGCGGAGGCG GCTGCCAACT TGTCAGCCGA CTTGCGATTG
GACGCGATCG GCGGCCTGCT TACGGACGAG ACCCGGGCCG TCCGAGTGGC CGCCGCCAGG
GTTCTTGGCG CCACGCCGTC ACTGGACGTT CTCGGCGCGC AACGCCGCGC TTTCGACGCC
GCACTCGGCG ATCTCAGTGC CTATGTAGAG GCTAACGCGG ACGTCGCCGA GACGCAGAGC
AGCTACGGAT CCCTACTTTT CGGCCAAGGG CGAACGGACG AGGCGGAAAA GGCTTTCCGC
CAAGCGATCG TTCTTGATCC GACGCTTTCT GGTGCGCACA TCAATCTTGC CGAGTTCTAT
CGCGCCAGCG GTGACAATGA GCGATCCGAG CAGACTTACG CCGAGGCGGT CGCCGCAAAT
CCGGATCGGG CAGATCTTCG CTACGGACAC GGCTTGTCTC TCGTTCGCCT CAAGGCCATG
CCGGACGCAA TTGAAGAACT AACGGCGGCC CTGCGCTTGG ATCCTGCCAA CTCCCGTTAC
AGGACGACCG CAGCGATCGC GCTTGATTCC GTGGGCCGAA CGGATGATGC GTTCGCTCTG
TTCGGCCCCA CCACCGCCGG TGGTGCAACT GTTGACGCCA CTCTGCTCGG CACTGCCATT
CAGCTCGGAC TTAAACTTGG TCGCTATGCC GACACGCTTA GGTTTGCGGA GGCGCTCGCT
CGTCTTCAGC CGAACGATCC ACAGATTGAA GAACTTGTCA GGCAATTACA GGATGCTGTT
CAACATGGCA GATAA
 
Protein sequence
MKRFVHAATV IRTALLFGAL LAYAAPVGGA ETAPPYAADQ LIAPQKGFVD EKTCTSCHAE 
QAAAFAKSHH AKAMALADDK SVRADFNSVR FERDGVAAEF FRRNGRFFIR TEGPGGKQAD
FEVKYTFAYE PLQQYLVDIG HGRLQAFDIA WDTQKQEWFW LGEGSAAKPG STFHWTGPFY
RWNRTCIDCH STGPQTNFEP QTNKYQSSYV ATSIGCQSCH GGGAKHVEWA RAKAANASAA
AADPGLAEVD ANTCFACHAR RTRLVDGYRP GSAFLDQFSP ALLRSDLYFP DGQILDEVFE
YGSFQQSKMA KAGVTCFDCH RAHEGTVKTE GNALCTQCHA ETAPERFAGN NPSGAFDTPA
HTHHPQGSSG AFCANCHMPE RTYMKVDPRR DHSFVIPRPD LSALYGTPNA CISCHASQTN
AWASEHLDGW YGKAWRERPS IAHAFARAAQ NDVAAIESLR RFLTDREQPG IIRGSAIGEM
TRLDGAATAA DVRVAAGDPD PLVRMGAAEA AANLSADLRL DAIGGLLTDE TRAVRVAAAR
VLGATPSLDV LGAQRRAFDA ALGDLSAYVE ANADVAETQS SYGSLLFGQG RTDEAEKAFR
QAIVLDPTLS GAHINLAEFY RASGDNERSE QTYAEAVAAN PDRADLRYGH GLSLVRLKAM
PDAIEELTAA LRLDPANSRY RTTAAIALDS VGRTDDAFAL FGPTTAGGAT VDATLLGTAI
QLGLKLGRYA DTLRFAEALA RLQPNDPQIE ELVRQLQDAV QHGR