Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3524 |
Symbol | |
ID | 5324412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 3727030 |
End bp | 3728475 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640792474 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001329175 |
Protein GI | 150398708 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.838951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGATC GTCTTTCGGA ACGTGATGAC AGGCCGGGCT TTTGCATTTC CGGCGCGGGC GGCTTGCCTG GAGCAGGCGG CCTTTTCGGA GCGGGCAGAT TGTGCGGCGG CTTGAGCGCA CTCGTTCTCG GAACTGTCTT GTCTGGCGGT TTTGTCGAGT CCGTATCGGG CAATCCGCTC GATGGACACG TCGTTTCGGG TTCGGCTTCG ATCGGGTCGG ACGGTACTGT CATGACTGTG ACGCAGGGAA CGGACCGTGC GGTGATTGAC TGGCGGAGCT TTTCGATCGG GGCGGGAGAG ACCACCCGCT TCGTGCAGCC GTCGGCGTCG TCCGTGACGG CCAACCGGGT GACGGGCGGG GACCCGTCCT CGGTTCTCGG GTCTTTGAAG GCCAATGGCA CGGTTGTTCT CGTGAACCGG AACGGGATCG TGTTCGGGAC GGATGCCCGC GTCGACGCGG GCGGGCTTGT CGCGACGGTC CATGATCTGG ATACGGCGGG GTTCATGTCG GGTTCGGACG TTCTGCGCTT CGAGGGGGGT GCTCGGCCCG GCGCCTCGGT CGTCAACCAC GGGACGATCA GCGTCAGGGA TGCGGGACTT GCGGCCTTCG TTGCTCCGCA TGTGCGCAAT GAGGGGGTGA TCACGGCCGA TTTTGGGCGG GTTGCGTTGG CTTCGGCGAA GGGATTTTCG GTCGATCTTT ACGGCGACGG TCTGCTGTCT TTCGCGCCCG GCGATGGGCT CGAGGAGACG ATTGGCGACG GCGCGGAAGC GCTGGTGCAG AACGGCGGCA CGCTTGCTGG GTCCCGGGTA CTTCTGACGG CGCATGCGGC GCGGGAAGTG GTGAACGCGT CGGTGAATGT GTCCGGTCTC GTGCGTGCGA CGTCGGTGTC GTCACGTGGA GGCGTGATCA CGCTTGGGGG ATCGGGATCT GTGCGCGTCT CCGGGCGCGG GCGGCTTGAT GCGTCCGGCT CGGGGGGCGG CGGCCGTGTG ACGATCAAGG CGGGGCCGTT CTCGACGGAC GGGACGATCG ATGTGCGCGG TGTGGATGCG AGTGGAGCTG CGGTTGCGCG GGGCGGCATA GTGGAGATCA CGGCGGACGG CGTGATGCTT GGCGGCGAGA TCACTGCGTC CGGCGCGTCC GGCGGCGGCG TGGATGTCGC GTCGCGGGGC GTGCTGTCTC TTGCGGGGCG CGTCGAGGCT CAGGGGCTCC TGGGTTCGGG GGGCAGCATC CGCTATCGCG GGCGGCGGGT CGTGGAGACG GGGACGGGGT CGACGAGCGT ATCGGGTCTG ACCCATGGGG GGACGATCAG CGTTATGGCG GACCAGTCGA TCGCCACGTC GGGTTCCTAT GCGGCGGGAG GCGTTTACGG CAAGGGGGGA CGCATCGACA TGAGCGCGCC GGACGTGCGC CTTCTGTCCG CGGGTCTTGA GGCGCGGGGA CGCTAG
|
Protein sequence | MKDRLSERDD RPGFCISGAG GLPGAGGLFG AGRLCGGLSA LVLGTVLSGG FVESVSGNPL DGHVVSGSAS IGSDGTVMTV TQGTDRAVID WRSFSIGAGE TTRFVQPSAS SVTANRVTGG DPSSVLGSLK ANGTVVLVNR NGIVFGTDAR VDAGGLVATV HDLDTAGFMS GSDVLRFEGG ARPGASVVNH GTISVRDAGL AAFVAPHVRN EGVITADFGR VALASAKGFS VDLYGDGLLS FAPGDGLEET IGDGAEALVQ NGGTLAGSRV LLTAHAAREV VNASVNVSGL VRATSVSSRG GVITLGGSGS VRVSGRGRLD ASGSGGGGRV TIKAGPFSTD GTIDVRGVDA SGAAVARGGI VEITADGVML GGEITASGAS GGGVDVASRG VLSLAGRVEA QGLLGSGGSI RYRGRRVVET GTGSTSVSGL THGGTISVMA DQSIATSGSY AAGGVYGKGG RIDMSAPDVR LLSAGLEARG R
|
| |