Gene Smed_0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0266 
Symbol 
ID5321098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp286024 
End bp287211 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content61% 
IMG OID640789201 
Productflagellin domain-containing protein 
Protein accessionYP_001325960 
Protein GI150395493 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCA TTCTCACCAA CATTGCGGCT ATGGCCGCTC TCCAGACTCT GCGCACGATC 
GGCTCCAACA TGGAAGAGAC GCAGGCGCAT GTCTCCTCCG GTCTGCGCGT CGGCCAAGCC
GCCGACAACG CCGCCTATTG GTCGATCGCA ACGACCATGC GCTCCGACAA TATGGCGCTT
TCCGCCGTTC AGGACGCCCT CGGCCTCGGC GCCGCCAAGG TTGATACTGC CTATTCCGGT
ATGGAATCGG CCATCGAAGT CGTTAAGGAA ATCAAGAAGA AACTCGTTGC CGCTACTGAA
GACGGTGTTG ACAAGGCCAA GATTCAGGAA GAAATCGATC AGCTCAAGGA TCAGCTCACG
AGCATTTCCG AGGCGGCGTC GTTCTCCGGT GAAAACTGGC TTCAGGCGGA CCTCAGCGGC
GGCGCAGTCA TCAAGAGCGT CGTCGGATCG TTCGTCCGGG ATGCGAGCGG TGCCGTGTCG
GTCAAGAAGG TCGACTACAG CCTCAACACC AACTCGGTTC TCTTCGATAC TGTCGGCGAC
ACCGGCATCC TGGACAAGGT CTACGACGTC TCGCAGGCAA GCGTTACGCT GACGATCAAC
ACCAACGGTG TCGCGTCGCA GCATACGGTC GCTGCCTATT CGCTGGAGTC TCTCACCGAA
GCTGGTGCGG AGTTCCAGGG CAACTACGCC CTCCAGGGCG GTAACAGCTA CGTCAAGGTC
GAGAACGTCT GGGTTCGCGC CGAGACCGCT GCAGCCGGCG CCACCGGCCA GGAGCTTGCC
GCCACCACAA CGGCAGCCGG CACCATTACC GCGGACAGCT GGGTCGTCGA CGTTGACAAC
GCACCTGCCG TCAGCGTTTC GGCCGGTCAG TCCGTTGCCG GGATCAACAT CGTCGGAATG
GGTGCAGCCG CACTCGATGC GCTGATCAGC GGTGTCGATG CTGCTCTGAC CGACATGACG
AGCGCAGCAG CCGACCTCGG CTCGATCGCC ATGCGCATCG ACCTGCAGAG CGACTTCGTC
AACAAGCTCT CGGACTCGAT CGACTCGGGC GTTGGCCGTC TCGTCGATGC GGACATGAAC
GAGGAATCGA CCCGCCTGAA GGCTCTTCAG ACCCAGCAGC AGCTTGCTAT CCAGTCGCTG
TCGATCGCGA ACTCGGCCTC GGAAAGCGTC CTCACGCTCT TCCGCTAA
 
Protein sequence
MTSILTNIAA MAALQTLRTI GSNMEETQAH VSSGLRVGQA ADNAAYWSIA TTMRSDNMAL 
SAVQDALGLG AAKVDTAYSG MESAIEVVKE IKKKLVAATE DGVDKAKIQE EIDQLKDQLT
SISEAASFSG ENWLQADLSG GAVIKSVVGS FVRDASGAVS VKKVDYSLNT NSVLFDTVGD
TGILDKVYDV SQASVTLTIN TNGVASQHTV AAYSLESLTE AGAEFQGNYA LQGGNSYVKV
ENVWVRAETA AAGATGQELA ATTTAAGTIT ADSWVVDVDN APAVSVSAGQ SVAGINIVGM
GAAALDALIS GVDAALTDMT SAAADLGSIA MRIDLQSDFV NKLSDSIDSG VGRLVDADMN
EESTRLKALQ TQQQLAIQSL SIANSASESV LTLFR