Gene Smed_0457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0457 
Symbol 
ID5321291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp493538 
End bp495361 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content61% 
IMG OID640789392 
ProductTPR repeat-containing protein 
Protein accessionYP_001326149 
Protein GI150395682 
COG category[R] General function prediction only 
COG ID[COG4785] Lipoprotein NlpI, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.559359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAGA ATACATTCCT TCGCCTCCTC AGCGGCGCGG CAATGCTGGT TCTTGCGACG 
GTGGCCGGCA ACCTGCAGGC CTTTGCCGAG GAGAAGGCGC CGGTCGAAGA CGCCGAACCT
TTCGACATCA AGAGCGTCAA CACCTTTGCC GGTGCCTTCC TTGCGGCGCG CACGGCCGAC
GTGGACCGCG ACTTCGCGGC GGCAACCGAC CTATATCGCA CTGCACTGAG GTTCGAGCCG
GGCAACACCG AGGTGAAGCA GCGGCTGATG ATCACGCTGT TGATGAGCGG CAGGTTCGAT
GAAGGCGCCA AGATCGCCGA AGAGCTGAAA TCGGATCCTG CCGTTGAAAG GATCACCACG
GTGGTCCGAG CGATCGAAGC GATCCGTAAG CGCGAATACC GCAATGCGCA AAAGCTCTTG
AAGTACGAGG GGCCGAACGA CCTCGATCGG CTGATGAGCT CGCTGCTTTC CGCTTGGGCG
AAGTTCGGTC AGGGCCGACC GAAAGAGGCC CTGGCGCAGA TCAAAAATCT CCAGGGTCCG
GAATGGTTCC GGATCTTCAA GAACTATCAT GCCGGAGCGA TCGCGCTTGC GGCCGGCGAC
AAGGCAACCG CGCGGACCCG GCTGAACGAC GCCGTTCTCG ACCGCGAAGG CGGCGGCGCG
GCACCGGATA CCTTCATGCG GGCGGTGGAG GCGCTGGCGC GGTTCGAGGC ACGCGAAGGC
AACAAGCAGA AGGCGTTGGA TACGATCGCC GTCGGCGAAA ATCTCGTAAA CAACTACACC
CCGCTCGAGG CTCTCAGAAA GAGTGTCGAA GAAGGAAAGA CGCAGGAACA GCAGGTTCGC
AACGCCGTGC AGGGTGCCGC CGCCGTGCTC TTTTCCATCG GTGGTGCGCT CAACCGGGAG
GGGGCGGAAG ACATCGTTTC GCTTTACCTC CAAACGGCGC GGCGGCTTGA CCCGGAAAGC
GCCGATATTC TGGTGATGCT CGGCGGCATC GCCGAAAATC TGAAGAAGCC GGACGAGGCG
ATCGAGCTTT ACAAGAGCGT GCCGGAAAAC TCGCCGATGC GTCGCCTTTC CGAGCTGCAG
CTTGGCCTGA GTCTCGCGGG CATCGGAAAG GTCGAAGAGG CGAAGAAGCA CCTGAAAGGG
CTGATCGACG TCGATCCCAA GAATATCCGC AATTATCTCG CCTATGGCAG CGTGCTTTCC
GACGCCAAGG ACTACAAGGC CATGGGCGAG CTTTACGATC GGGCGGTCGA GGCGATCGGT
CCCGTTCCAA AGCGCAGCGA CTGGACCATC TTCTTCCAGC GCGGCATCGC CTATGAGCGG
CAGAAGCTTT GGGAGAAGGC CGAGCCGAGC TTCCTCAAGG CGCTGGAGCT CAACCCGGAT
CAGCCCCAGG TGCTCAACTA TCTCGGCTAC TCCTGGGTCG ATATGAACGT AAAGCTCGAA
GAGGGCCTCG ACATGATCCG CAAGGCGGTC GAACTCAAGC CCGACGACGG CTATATCGTC
GACTCGCTCG GCTGGGCGTA TTTCCGCATG GGCCGTTTCG ATGAGGCCGT GGCCGAACTC
GAGCGCGCGG CCGAGCTGAT GGCCGGCGAC GCGACGATCA ACGATCACCT TGGCGACGCC
TACTGGCGCG TCGGCCGGAA ACTCGAAGCG GTGTTCCAGT GGAATCAGGC GCTGGAGCTG
AAGCCTGAGG AGGCTGAAAT TCCCAAGATA AAGGCGAAGA TCGAAAATGG TCTGCCGCCG
TTGAAAGAGT CGGTTCCGGC CGCGGCGGAT GCCAAGGAAA AGCTTCCGAA AAAAACGGAG
CCGGCGCCGG ACAAGAAGTC CTGA
 
Protein sequence
MRQNTFLRLL SGAAMLVLAT VAGNLQAFAE EKAPVEDAEP FDIKSVNTFA GAFLAARTAD 
VDRDFAAATD LYRTALRFEP GNTEVKQRLM ITLLMSGRFD EGAKIAEELK SDPAVERITT
VVRAIEAIRK REYRNAQKLL KYEGPNDLDR LMSSLLSAWA KFGQGRPKEA LAQIKNLQGP
EWFRIFKNYH AGAIALAAGD KATARTRLND AVLDREGGGA APDTFMRAVE ALARFEAREG
NKQKALDTIA VGENLVNNYT PLEALRKSVE EGKTQEQQVR NAVQGAAAVL FSIGGALNRE
GAEDIVSLYL QTARRLDPES ADILVMLGGI AENLKKPDEA IELYKSVPEN SPMRRLSELQ
LGLSLAGIGK VEEAKKHLKG LIDVDPKNIR NYLAYGSVLS DAKDYKAMGE LYDRAVEAIG
PVPKRSDWTI FFQRGIAYER QKLWEKAEPS FLKALELNPD QPQVLNYLGY SWVDMNVKLE
EGLDMIRKAV ELKPDDGYIV DSLGWAYFRM GRFDEAVAEL ERAAELMAGD ATINDHLGDA
YWRVGRKLEA VFQWNQALEL KPEEAEIPKI KAKIENGLPP LKESVPAAAD AKEKLPKKTE
PAPDKKS