Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5971 |
Symbol | |
ID | 5320273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 929726 |
End bp | 931201 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640777652 |
Product | hypothetical protein |
Protein accession | YP_001314584 |
Protein GI | 150377989 |
COG category | [S] Function unknown |
COG ID | [COG5361] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.346043 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATCA CTCGGAGGGA TGCGTTCAAA ATGGCTTCGT CTGCTGCAAT TCTTGGAGCT AGCGCGGCGG CAGCACACGA TGCTGCCGCC AAAGCCGTCC CGCAGGGTGT CGATCTGGAG TTCGACCTCG GAATTCCTAC GCAAGATACG GTCGAGAAGC TTTACGACAC GATGGATTTT CAACGCGCAG TGCAGGGCTA TCTCTGGGCG GTTCCGATCG TCGGAATGGA AGGTGCGCGC CGGATGCTTG TCGACAACGC CGAAGCCAGG AGCGGCGATC TTGTGCTTGT TGCCGGGTAT AGGGACGTCA GTGCCATGCT CGGGTCGAAT GTGACGACGC CCTATGTGTT CGCGTGGTTT GATCTTACAG AAGGACCGAT TGTCATCGAA TATCCCGAAG GCGCCACGGC CGGCTCCCTG ATCGACTGGT GGGATCGTCC GCTCATTGAT GTCGGCGTTT CGGGCCCAGA TGGCGGCAAG GGAGCGAAAT TCGTAGTGGT TGGTCCGGCG CATGAAGCGC CGGAAAATTC CCCGGCTGGC GCAAAACTGC TGCGTTCCCG CACCAACAAA GTCCTGTTGT TCTGCCGGGG ACTCGATGGT GACCTCAAGA CGGTCGAGGC TGTTTTCTCT AACACTCAGG TCTATCCGCT TGGCGCTACA GGGAGTGGAG TGGCCGCGTT TCTTAGATTC AAGACGGAGG GCGAGTTGAC CAGCATGGCT CATCCTAAGG GCCTCGCATA CTGGCAGTCG TTGATTCAGG CGCTGGATGG TGAGCAGATC GAGGATCGAG ACCGCTTCTT TGCCGCTATG TTGAAGCCGC TCGGCGTCAC CTATGGCGGA TCGTTCTCGC CAAACGACCG GCAGACGGGG TTACTTCACA ACGCCGCAAT CCTCGGCGAA GCGATGGCGA AGGCCAGTGC TTTCAGCAAG CGCATTCCAG GGATGCGCTA TAGGGACGAT ACGCACTGGG AATATTTGAT CCCTCAGGAC TTTGTCAACG AACAGGACGG ACCGGATGGT ACTCTGCTCG ATCAACGGAC GGCCTTCTTC TACGAGGTCA CAGGCACTTC CGCCGCCGTT CTCACCAAGA CACCCGGAAC TGGCTCGGCA TACCTCACCG CCTACAGTGA TCCTGACGGA CACGCTTTCG ACGGCGCAAA GTCTTACCGG TTGCGCGTTC CAGCCAATGT ACCCGCCAAG ACCTTTTGGT CGATCACGCT CTACGACACC GAGACGCGCG GTCTCATTCA GAACAAGCAA CAGATCGTGG ATCGGTCCTC ACGGCAAAAT CTCAAGGTCC AAAACGACGG CTCGATTGAG ATCGTTATGG GACCGCAGAC TCCGGATGGC CTGGAGCAGA ACTGGATACC GACAACGCCA GGTAAGGCTT GGTTTGTGTA TTTCCGCTTG TTCGGTCCGC TAGAGCCATA TTTCGACAAG TCGTGGCGCT TGCCTGACAT TGAGAAGGCC ATATAA
|
Protein sequence | MEITRRDAFK MASSAAILGA SAAAAHDAAA KAVPQGVDLE FDLGIPTQDT VEKLYDTMDF QRAVQGYLWA VPIVGMEGAR RMLVDNAEAR SGDLVLVAGY RDVSAMLGSN VTTPYVFAWF DLTEGPIVIE YPEGATAGSL IDWWDRPLID VGVSGPDGGK GAKFVVVGPA HEAPENSPAG AKLLRSRTNK VLLFCRGLDG DLKTVEAVFS NTQVYPLGAT GSGVAAFLRF KTEGELTSMA HPKGLAYWQS LIQALDGEQI EDRDRFFAAM LKPLGVTYGG SFSPNDRQTG LLHNAAILGE AMAKASAFSK RIPGMRYRDD THWEYLIPQD FVNEQDGPDG TLLDQRTAFF YEVTGTSAAV LTKTPGTGSA YLTAYSDPDG HAFDGAKSYR LRVPANVPAK TFWSITLYDT ETRGLIQNKQ QIVDRSSRQN LKVQNDGSIE IVMGPQTPDG LEQNWIPTTP GKAWFVYFRL FGPLEPYFDK SWRLPDIEKA I
|
| |