Gene Smal_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_1895 
Symbol 
ID6475920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp2132951 
End bp2134168 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID642731077 
Productflagellin 
Protein accessionYP_002028282 
Protein GI194365672 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.226617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0800037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG TAATCAACAC CAATACGATG TCGCTCAACG CTCAGCGCAA CCTGAGCACC 
AGCGGCAGCT CGCTCGCTAC CACCATCCAG CGCCTCTCGT CCGGCCTGCG CATCAACAGC
GCGAAGGACG ATGCGGCCGG CCTGGCCATC AGCGAGCGCT TCACCACCCA GATCCGTGGT
CTGGACGTGG CCATCCGCAA CGCCAACGAC GGCATCTCGC TGGCCCAGGT CGCCGAAGGT
TCGCTGAGTG AAGTCGGCAA CAACCTGCAG CGCATCCGTG AACTGGCGGT GCAGTCCTCC
AACGCCACCA ACTCCTCCAG CGACCGCAAG GCGCTGCAGG CCGAAGTGAC CCAGCTGGTC
TCGGAAGTGG ACCGCGTCGC CAAGCAGGCC GATTTCAACG GCACCAAGCT GCTGGACGGC
TCCTTCACCA GCCAGCTGTT CCAGGTGGGT GCCAACGCCG GCCAGGCCAT CGCCATCAAC
AGCGTGGTCA ACGCCAAGGC CGATTCGCTG GGCGCCGCCT CCTTCGCCAA CGGCTACACC
GGCACCGCGC TGGCGACCGA CAAGGCCACC GCCGACACCA CCTACTCCGG CCTGCAGATC
TCGGTAACCC CGCCGGGCGG CACCGCCACC ACGATCACCG TCAACGACTT CACCGTGAAG
GCGGGCGAGT CGATCACCTC GGCCACCTCC GCTGCGATCA ACAACAAGCT GGGCGAAACC
GGCGTCATGG CCTCGGTCAA TGCCGGCGTG ATCAGCCTGG CTTCGGTCAA GGACGGCCAG
ACCTTCACCC TCGGCGTCAG CGCCGCGACC CCGCCGACCG GCGCCACCGC CGCGACCATC
GCCGGCCTCG GCCTGACCGA GACCAGCACC AATGGCGGCA CGCTCACCGG CACCTCGGCC
AGCTTCGTCA AGGACCTGAA CGTCACCACC GCCGAAGGCG CGCAGAAGGC GCTGTCGATC
GTCGACAAGG CCCTGGAATC GGTCAACAGC GTCCGCGCCG ACCTCGGTGC GATCCAGAAC
CGCTTCACCT CGGTGGTGGC CAACCTGCAG ACCTCCTCGG AGAACCTGTC GGCCTCGCGC
AGCCGCATCC GCGATACCGA CTTCGCCAAG GAAACCGCCG AACTGACCCG CACCCAGATC
CTGCAGCAGG CCGGTACGGC CATGCTGGCC CAGGCCAACC AGGTACCGCA GAACGTGCTC
AGCCTGTTGC AGCGCTGA
 
Protein sequence
MAQVINTNTM SLNAQRNLST SGSSLATTIQ RLSSGLRINS AKDDAAGLAI SERFTTQIRG 
LDVAIRNAND GISLAQVAEG SLSEVGNNLQ RIRELAVQSS NATNSSSDRK ALQAEVTQLV
SEVDRVAKQA DFNGTKLLDG SFTSQLFQVG ANAGQAIAIN SVVNAKADSL GAASFANGYT
GTALATDKAT ADTTYSGLQI SVTPPGGTAT TITVNDFTVK AGESITSATS AAINNKLGET
GVMASVNAGV ISLASVKDGQ TFTLGVSAAT PPTGATAATI AGLGLTETST NGGTLTGTSA
SFVKDLNVTT AEGAQKALSI VDKALESVNS VRADLGAIQN RFTSVVANLQ TSSENLSASR
SRIRDTDFAK ETAELTRTQI LQQAGTAMLA QANQVPQNVL SLLQR