Gene Smed_3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3524 
Symbol 
ID5324412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3727030 
End bp3728475 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content67% 
IMG OID640792474 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001329175 
Protein GI150398708 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.838951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGATC GTCTTTCGGA ACGTGATGAC AGGCCGGGCT TTTGCATTTC CGGCGCGGGC 
GGCTTGCCTG GAGCAGGCGG CCTTTTCGGA GCGGGCAGAT TGTGCGGCGG CTTGAGCGCA
CTCGTTCTCG GAACTGTCTT GTCTGGCGGT TTTGTCGAGT CCGTATCGGG CAATCCGCTC
GATGGACACG TCGTTTCGGG TTCGGCTTCG ATCGGGTCGG ACGGTACTGT CATGACTGTG
ACGCAGGGAA CGGACCGTGC GGTGATTGAC TGGCGGAGCT TTTCGATCGG GGCGGGAGAG
ACCACCCGCT TCGTGCAGCC GTCGGCGTCG TCCGTGACGG CCAACCGGGT GACGGGCGGG
GACCCGTCCT CGGTTCTCGG GTCTTTGAAG GCCAATGGCA CGGTTGTTCT CGTGAACCGG
AACGGGATCG TGTTCGGGAC GGATGCCCGC GTCGACGCGG GCGGGCTTGT CGCGACGGTC
CATGATCTGG ATACGGCGGG GTTCATGTCG GGTTCGGACG TTCTGCGCTT CGAGGGGGGT
GCTCGGCCCG GCGCCTCGGT CGTCAACCAC GGGACGATCA GCGTCAGGGA TGCGGGACTT
GCGGCCTTCG TTGCTCCGCA TGTGCGCAAT GAGGGGGTGA TCACGGCCGA TTTTGGGCGG
GTTGCGTTGG CTTCGGCGAA GGGATTTTCG GTCGATCTTT ACGGCGACGG TCTGCTGTCT
TTCGCGCCCG GCGATGGGCT CGAGGAGACG ATTGGCGACG GCGCGGAAGC GCTGGTGCAG
AACGGCGGCA CGCTTGCTGG GTCCCGGGTA CTTCTGACGG CGCATGCGGC GCGGGAAGTG
GTGAACGCGT CGGTGAATGT GTCCGGTCTC GTGCGTGCGA CGTCGGTGTC GTCACGTGGA
GGCGTGATCA CGCTTGGGGG ATCGGGATCT GTGCGCGTCT CCGGGCGCGG GCGGCTTGAT
GCGTCCGGCT CGGGGGGCGG CGGCCGTGTG ACGATCAAGG CGGGGCCGTT CTCGACGGAC
GGGACGATCG ATGTGCGCGG TGTGGATGCG AGTGGAGCTG CGGTTGCGCG GGGCGGCATA
GTGGAGATCA CGGCGGACGG CGTGATGCTT GGCGGCGAGA TCACTGCGTC CGGCGCGTCC
GGCGGCGGCG TGGATGTCGC GTCGCGGGGC GTGCTGTCTC TTGCGGGGCG CGTCGAGGCT
CAGGGGCTCC TGGGTTCGGG GGGCAGCATC CGCTATCGCG GGCGGCGGGT CGTGGAGACG
GGGACGGGGT CGACGAGCGT ATCGGGTCTG ACCCATGGGG GGACGATCAG CGTTATGGCG
GACCAGTCGA TCGCCACGTC GGGTTCCTAT GCGGCGGGAG GCGTTTACGG CAAGGGGGGA
CGCATCGACA TGAGCGCGCC GGACGTGCGC CTTCTGTCCG CGGGTCTTGA GGCGCGGGGA
CGCTAG
 
Protein sequence
MKDRLSERDD RPGFCISGAG GLPGAGGLFG AGRLCGGLSA LVLGTVLSGG FVESVSGNPL 
DGHVVSGSAS IGSDGTVMTV TQGTDRAVID WRSFSIGAGE TTRFVQPSAS SVTANRVTGG
DPSSVLGSLK ANGTVVLVNR NGIVFGTDAR VDAGGLVATV HDLDTAGFMS GSDVLRFEGG
ARPGASVVNH GTISVRDAGL AAFVAPHVRN EGVITADFGR VALASAKGFS VDLYGDGLLS
FAPGDGLEET IGDGAEALVQ NGGTLAGSRV LLTAHAAREV VNASVNVSGL VRATSVSSRG
GVITLGGSGS VRVSGRGRLD ASGSGGGGRV TIKAGPFSTD GTIDVRGVDA SGAAVARGGI
VEITADGVML GGEITASGAS GGGVDVASRG VLSLAGRVEA QGLLGSGGSI RYRGRRVVET
GTGSTSVSGL THGGTISVMA DQSIATSGSY AAGGVYGKGG RIDMSAPDVR LLSAGLEARG
R