Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0465 |
Symbol | |
ID | 5321299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 501166 |
End bp | 503100 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640789400 |
Product | putative transmembrane signal peptide protein |
Protein accession | YP_001326157 |
Protein GI | 150395690 |
COG category | [S] Function unknown |
COG ID | [COG4907] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.559326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.552522 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCGGC TTTTAGCGGC GCTTGCGCTC GTCTTTTTCG TTCTTGCAAT GCCGCTAGAC GCGGCGGCGG AAGAGTTTAT CTCCGCCTAC CATTCGGTGA TCGGCGTCGC AAAGGACGGC ACGCTGACGG TAACCGAGAC GATTACGGCC AATGTGGAGG GAAACCAGAT AAGGCGCGGC ATCTACCGCG ACTTTCCGTT GACCTTCGTC GATGAGCGCG ACCGCCGCAG CAAGGTCGAC TTCAAACTCC TCTCCGTCGA GCGGGATGGC GATGATGAGG ATTACCGGAC CGAATCGATC AACGGCGGCA TCCGCATCTA CACCGGGAAC GCCGACGTTC TGCTGCCGCA TGGCGAGCAC ACCTTCCAGA TCACCTATGA GACGAGCCGG CAGATACGCT TCTTCGATGA TCACGACGAA CTCTACTGGA ATGTGACGGG AACCGAATGG GCATTTCCGA TCGAGGAGGC CACGGCCACC GTGACACTGC CTGATGGAGT GAAGGCGAAG GCGCTCGACG TCTTTACCGG CGGCTACGGG GCGACGGAAA AAGATGCGCG GGCGGTGGAG GAGGGTGACG AAATCTTCTT CGCGACGACG CGCCGGCTGC GCCCGCAGGA AGGATTGACC GTCGCGATCA AGCTGCCCAA GGGCAGCATC GAGCGCCCCA CTCCTTCGCA GGAAAATATC TGGTGGCTGC GCGACCACGC GGCCCTGGTC ATCGCCGGAG CCGGCCTCCT CTTCGTGACG CTTTATTACG GGCGCGCCTG GATTCGTGTC GGCCGCGACC CGACGCGCGG GGTCATGGTC CCGCGCTGGG ATCCTCCGGA GGGTGTCTCG CCCGCGCTGG TCAACTACAT CGACAACAAG GGTTTTTCCG GCGGGGGATG GACGGCTCTC TCAGCTGCGG CGCTCAACCT TGCGGTGCGG GGACATGTTG TCCTGGAAGA CCTGAAGAAT GCGATCATCA TCACCGCCAC GGGCAAGACC GGTGAAAAGC TGCCGACCGG CGAAGCGGCC TTGATGAGGG CGGTCGAAGC CGCTGACGGC AAGCTCACCA TCGACCGCGA GAATGGGAAG AGGATTCAGG CCGCCGGTTC CGGCTTTCGC AGCGCGATGG AGCGCGAGCA TCGTGGAAAG TATTACCGCG CCAATAAGAG CTACGTCGTC GTCGGCATCG TTCTTTCGGC CGCCACCCTC GCGGCGCTGC TCATCTTCGG CGGCTTGAGC GAGGACAGCA TTCCTTTCGT GATCGTCCCG GTTTTTCTCG CCGTCTTTAT TGCCGCCTTC GCCGTGTCGG TCGGCAAATC GTTCCGGCGC AGCTCGAGCC TCAGACGCCG AATCCTTTCG ATCGTGGTCC TGGCTTTCAT GGGCTTCGTG CTCTTCACCG AATTTTCGAG CATTCTCGCC GCGCTCGTCT TTTCAGCCAG CGACCCAGCC GACCTGCCGT TGTTCTTCGC AATCGGCGGC ATCGTCCTCG TGAACGGGCT GTTCTATTTT CTCATGGGCG CGCCCACACC GCTGGGTACG CGCATGATGG ATGGTATCGA CGGTCTCAGG CAATACCTGA CGCTCGCCGA AAAGGACCGG CTGAACATGC AAAGCGCGCC GGAAATGTCG CCCCGGCATT TCGAAACCCT GCTTCCTTAT GCAGTGGCTC TCGGAGTGGA GAAGCCCTGG AGCGAGACCT TCGAGCGCTG GCTGCTTGCA GCTTCCGCCG GCGCGGCTGC GGCCGCCTAC CAGCCGAGCT GGTATCATGG CGATTCCTTC GGCCCCGGAT CCTTCACCGA CACGATCGGC GGTTTGGCCG GTTCGATGAC GGATAAGATC ACGTCTTCCT TGCCGCCGCC GGCCAGGAGT TCGTCCTCCG GCTTTTCCTC CGGCGGCGGG TTTTCCGGCG GCGGCGGAGG AGGTGGCGGC GGCGGCGGCT GGTGA
|
Protein sequence | MRRLLAALAL VFFVLAMPLD AAAEEFISAY HSVIGVAKDG TLTVTETITA NVEGNQIRRG IYRDFPLTFV DERDRRSKVD FKLLSVERDG DDEDYRTESI NGGIRIYTGN ADVLLPHGEH TFQITYETSR QIRFFDDHDE LYWNVTGTEW AFPIEEATAT VTLPDGVKAK ALDVFTGGYG ATEKDARAVE EGDEIFFATT RRLRPQEGLT VAIKLPKGSI ERPTPSQENI WWLRDHAALV IAGAGLLFVT LYYGRAWIRV GRDPTRGVMV PRWDPPEGVS PALVNYIDNK GFSGGGWTAL SAAALNLAVR GHVVLEDLKN AIIITATGKT GEKLPTGEAA LMRAVEAADG KLTIDRENGK RIQAAGSGFR SAMEREHRGK YYRANKSYVV VGIVLSAATL AALLIFGGLS EDSIPFVIVP VFLAVFIAAF AVSVGKSFRR SSSLRRRILS IVVLAFMGFV LFTEFSSILA ALVFSASDPA DLPLFFAIGG IVLVNGLFYF LMGAPTPLGT RMMDGIDGLR QYLTLAEKDR LNMQSAPEMS PRHFETLLPY AVALGVEKPW SETFERWLLA ASAGAAAAAY QPSWYHGDSF GPGSFTDTIG GLAGSMTDKI TSSLPPPARS SSSGFSSGGG FSGGGGGGGG GGGW
|
| |