Gene Smed_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1942 
Symbol 
ID5322801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1994093 
End bp1995973 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content62% 
IMG OID640790880 
ProductTPR repeat-containing protein 
Protein accessionYP_001327611 
Protein GI150397144 
COG category[S] Function unknown 
COG ID[COG5616] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.420919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGATG AGAGAATACT CCCGAATGCG GCCCTCGCAA AGATTGCAGG CGAATCGGTC 
GAGAACCTGT CAACTCATCG GATTTCGGTC ACACATGGCC AAGAGATGGA CGGCATGAGA
CGGCTGGCGG CGATCTTGGA CGCGGACGTC GTCGGCTATA GCCGGCTGAT GGGACTGGAC
GAAGCCGGCA CTTATCGGGC GGTCAAGCAT TGCCACAACG CCTTCATTCT GCCGTTGGTC
GAGGCCCATA ACGGCAGGAT CGTCAAGCAG GCGGGCGATG GAATGCTCGC AGAGTTCGCA
AGCGTACTCG ACGCCGTCGC TTGCGCCATC GCGATCCAAC GAACAATGCA CGATCAGGCC
GGGAGCGCGG AAACTGAGCG TCTGGAATTG CGCATCGGCG TCCACCTCGG CGATATCGTC
GCGGACGACG GCGATATTCA CGGCGAAGGC ATTGCCGTTG CCGGGCACCT TCAGGAAATG
GCGCCGCCCG GCGGCATCTG CGTGTCGCAG CAAGTCTATG ACCAGGTTTC CTCGAAACTG
GATATCCAGA TGGGAGACCT CGGCTGCAAG ACGTTTGCCG ATATTCCCGG CCCGCTGCAC
GTCTGGTGCT GGCAGCCGGG CGCAACACGG GAAGAATCGC CCGCACCAAA GCAGAACCGG
CCGCGTCCTG ACATGAAGCG GCCGTCGATC GCCGTCCTGC CATTCGTCAA CTTGTCGAGC
GTCGACGAAC AGGAGCATTT CTCCGACGGC TTCACGGAGG AGTTGATTTC CACCCTGGCC
CGATGCCGTT GGCTGCGCGT CGTCGCGCGC AACTCCTCCT TCACTTTCAA GGGGGTAACT
GTCGACGTGA GAAAGGTTGC GTCCGACTTG GGCGTTAAAT ACGTGATCGA GGGCAGCATA
CGCCGCGCGG CAAACCGGAT CCGCATCACG GCGCAATTGC TGAGCGGCGA AACCGGCATG
CTGCTCTGGG CAGAGCGCTA CGACCGCATG CTGGACGACG TTTTCGTGCT GCAGGATGAG
ATCGCGGGAC AGATCACCGG TACTGTGGAA CCCGAACTCG GCTTCATCGA ATTCGCAGCG
CTGCGCGGCC AAAGCGCGAC GGACATGGAT GCCTGGAATA TCTATCTCAA GGGGTTATGG
CACCTCTACA AGTTTGATCT TGAGAATCTG AGGATTTCCA AAGAGCTGTT CGAGCGAGCG
ATCGACCTCG AACCCGCTTT TGCCCAGGCC TATGCGCGCC TTGCTTATGT CCATATACAG
CTCGGCTGGT ATGGCCCTCT TGAGGAGCGG GGCGATCGGA TCGCCGACGC GACGGCGCTG
GCTGAACGTG CGACCGCGCT CGACGACCGT GAGCCGGCAG CGCATCTGGC ACTCGGCCGG
GCACGGGCAC TCGGCGGCCA GCCGGAGCGT GGGATCGAAC ACCTGCGCAA CGCACTGAGG
CTTGTCCCAA GCTTCGCCCA GGGCCACTTT GCTCTCGGGC AAGCGCTTTG TTATGTGGGC
CGCCCCGAAG AGGGCATCAC CGCGATCAAC GAGGCGTTCC GGCTGAGCCC TCGAGATCCG
CATCTGTGGA CGTTTCACAA CATGGTCGCC ATCGCCCAAT ACCAGGCGGG TCGCTTCGCG
CAAGCCGCCG AGGCGGCCCG CGCCTCTCTA CTCAAGGAAA ATGCCACGTT CTGGCCCGCA
ATGGTGCTGG CAGCGTCCCT CGGCGCCCAG GAGCGGAAGG GCGAGGCCCG CGCGGCGGTG
GCGGAGCTTT TGCGCCGGCG GCCGGACATG ACCGCGAAAA CGGCCCGCGC CGAATTCTAC
TTCGGCAGCG TGCCGGCCAT GTCCGAGAAA TTCATCGACC GCTTCGTCAG CGATCTGCAC
CGCGCCGGCG TGCCTGATTG A
 
Protein sequence
MLDERILPNA ALAKIAGESV ENLSTHRISV THGQEMDGMR RLAAILDADV VGYSRLMGLD 
EAGTYRAVKH CHNAFILPLV EAHNGRIVKQ AGDGMLAEFA SVLDAVACAI AIQRTMHDQA
GSAETERLEL RIGVHLGDIV ADDGDIHGEG IAVAGHLQEM APPGGICVSQ QVYDQVSSKL
DIQMGDLGCK TFADIPGPLH VWCWQPGATR EESPAPKQNR PRPDMKRPSI AVLPFVNLSS
VDEQEHFSDG FTEELISTLA RCRWLRVVAR NSSFTFKGVT VDVRKVASDL GVKYVIEGSI
RRAANRIRIT AQLLSGETGM LLWAERYDRM LDDVFVLQDE IAGQITGTVE PELGFIEFAA
LRGQSATDMD AWNIYLKGLW HLYKFDLENL RISKELFERA IDLEPAFAQA YARLAYVHIQ
LGWYGPLEER GDRIADATAL AERATALDDR EPAAHLALGR ARALGGQPER GIEHLRNALR
LVPSFAQGHF ALGQALCYVG RPEEGITAIN EAFRLSPRDP HLWTFHNMVA IAQYQAGRFA
QAAEAARASL LKENATFWPA MVLAASLGAQ ERKGEARAAV AELLRRRPDM TAKTARAEFY
FGSVPAMSEK FIDRFVSDLH RAGVPD