Gene Smed_1294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1294 
Symbol 
ID5322141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1390601 
End bp1391641 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content61% 
IMG OID640790235 
Productrare lipoprotein A 
Protein accessionYP_001326979 
Protein GI150396512 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0797] Lipoproteins 
TIGRFAM ID[TIGR00413] rare lipoprotein A 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.347531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCG ATAATGGTGC GGCATATCTT GTGAGGGGGC TCCGTCTCGC AGCGGTTCCG 
TTGCTTTGCG CGGTGCTTGC GTCCTGCGGA TCGACGTCGA GCGTCAAGAA AACCAAGCCG
CGGAGCAAGG AATATTTCGC CGAGTCGGTC TACGGCGTCA AAGCCAGCCC CAGAGTAGCC
ACCGGGAATA ACATTCCCAA GGGTGGCGGA CGCTATCAGG TCGGCAAACC TTACCAGGTC
AAGGGCAGGT GGTATAAGCC GAAAGAGGAT TTCGGCTATA ACAAAAGCGG TATGGCCTCC
TGGTACGGTT CGGCCTTCCA CGGCCGGCTG ACGGCGAACG GGGAGGTCTA TGACACAAAT
CACCTTTCGG CCGCCCACCC GACATTTCCG CTGCCGAGCT ATGCGCGCGT GACCAATACG
GAAAACGGCA CGTCGGTCGT CGTGCGTGTC AACGATCGCG GCCCCTATGA ATACGGCCGG
ATCATCGACG TCTCTTCGAA GACCGCCGAC CTGCTCGATA TCAAGCGTAA GGGCAGCGCC
AAGGTTCGCG TGCAGTATAT CGGCCGCGCG CCACTCGAAG GCAACGACAT GCCTTACCTC
ATGGCTTCCT ACGTCAGGAA GGGCGAACGC GGTCCGAGCG TGATGCCGGA AGGACAGATC
GCAACAGGCG TGATGGTCGC CTCCAACAAG CCGCTACGGG AGCTGATTCC CGATGTCGGC
GCGGTACCCG TTCCGAATAG ATCCGCGCTC GAGCCCGGTG CGCCGATGAA CGCGCTTGCC
GAACCGGCAA AAGCCACTGC AGTTTCAGGT GCATTCGACG AGTTTGCAAT CCTGCCGGAG
ATCGGTCCCG TGCCCAGGGC GCGTCCTCAG TTGGTCCCGC TGCCGGACGG CAGCCTGACC
TATGCCGCGG CCTATGTCGA AGTCCGCGTC AGCGACGAGC CTTCGCCATT CGAGGCGATC
ATGGTCGAGC GCAATCCGCT GACACCCGAA TCCATTCTTG CCTATGCGAA GCGCCGGCAC
CAAAACGCCA CCGCGCGCTG A
 
Protein sequence
MTSDNGAAYL VRGLRLAAVP LLCAVLASCG STSSVKKTKP RSKEYFAESV YGVKASPRVA 
TGNNIPKGGG RYQVGKPYQV KGRWYKPKED FGYNKSGMAS WYGSAFHGRL TANGEVYDTN
HLSAAHPTFP LPSYARVTNT ENGTSVVVRV NDRGPYEYGR IIDVSSKTAD LLDIKRKGSA
KVRVQYIGRA PLEGNDMPYL MASYVRKGER GPSVMPEGQI ATGVMVASNK PLRELIPDVG
AVPVPNRSAL EPGAPMNALA EPAKATAVSG AFDEFAILPE IGPVPRARPQ LVPLPDGSLT
YAAAYVEVRV SDEPSPFEAI MVERNPLTPE SILAYAKRRH QNATAR