Gene M446_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3937 
Symbol 
ID6130256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4385627 
End bp4387111 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content67% 
IMG OID641644095 
Productflagellin domain-containing protein 
Protein accessionYP_001770737 
Protein GI170742082 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0911529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00880768 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGACGAGCC TGCTGACCAA CTCCGCCGCG ATGACGGCGT TGACCACGCT GAAGTTGATC 
AACGCGAACC TCGACACCAC GAGCAACCGG GTGTCGACGG GCCAGCGCGT CTCGGCCGCG
GCCGACAACG CCGCCTACTG GTCGATCGCG ACCGCGGTGC GCTCCGACAA CGCCTCGCTC
GGCGCCGTGA AGGACTCGCT CGGCCTGGGG GCCTCCGCGG TGGGGACGGC CTATAACGGC
ATCAATAGCA TCATCTCGGA CCTGCAGAAC ATCCGGGCCA AGCTGCAGAC CGCCCTCCAG
GGCGGGACCG ATCGCAGCAA GGTGCAGACC GAGATCTCCG CCATCCAGAA CAAGATGAAG
GCCACGGCCG ACTCGTCCGT GTCGAACGGA GTGAACTGGC TGTCGGTGGA TTCCTCGGCC
ACGAACGCGC GGTTTCGTCC GGTGGAGAGC GTGGTGGCGG GCTTCGCGCG CAATGCCGCC
GGCACCGTCT CGTTCTCGAC CATCGACGTC AACGTGAACG CGATCAAGCT CTACGACGTC
AACGCGACCA GCATCACCTC GGCGGCGACC CAGGCGCAGT TCACGGCAGG CCAGTCGCTC
ACCGGGACGC CGCTCTTCAC GAACGGGACG GCCGACTTCT CGGGCACGAA CGAGGTCAAC
TTCACCCTCC AGATCGACCG GCTCGGCACC GCGGGTGGGG CGGCCGGAAC GGCCGCGGGC
GCCTATGGCG GCAAGGTGAA CATCGTCCTG AACAATTCGA CGCTGATAAC GGCGGCAAAC
GATCGCTCGA AGGTCACGAC GGACGAATTC CTGAGAGCCA TCAACAACGT CATCGGCGCG
AGCACGTTGC CCCAGACGGG AGCCGGCGGC TCCGCGGTGG CGATCACGAC GGGCGGGCTG
AAGGGCCTGA TCACCGCCGC CCTCGATTCC TCGGGCCGGC TGGTCTTCCG CACCACGGAT
ACCGGCGCGA CCCTGACCGC CACCCTGACC GTCGGGACCG CCACGGCGGG CAACACGCTG
AAGGATTTCG GCTTCGGCAC GACGGCGGGG CTCGCGGCCA CCGGCAAGGG CACGGATGCG
GGAACCACCA CGGCCCGCGG CATCATCGAC ACGAGCGTCG GCAGCTACGA TGCCTCCTTC
GGCGGCGGCA GCTACTCGAT CGCCAATTTC GACATTTCGA AGCTGGTCGG GACGGCCGGC
GACAGCAACC TCAAGGACAT TATCGCGGCC GTCGACAAGG CCCTGGCGGC GGTCACCGAT
GCCGGCACCA AGCTCGGCGC GGGCAAGAAC CAGATCGAAG GCCAGACGAG CTTCGTCGAC
TCGCTCATGA AGGCGAACAC CGCCACGATC GGCACCCTGG TCGACGCCGA CATCGAGGAG
GAATCGACGA AGCTGAAGGC GCTGCAGACG CAGCAGCAAC TCGCCGTCCA GGCGCTCAGC
ATCGCGAATT CCTCAGGGCA AGCCCTGCTC ACCCTGTTCC GCTAA
 
Protein sequence
MTSLLTNSAA MTALTTLKLI NANLDTTSNR VSTGQRVSAA ADNAAYWSIA TAVRSDNASL 
GAVKDSLGLG ASAVGTAYNG INSIISDLQN IRAKLQTALQ GGTDRSKVQT EISAIQNKMK
ATADSSVSNG VNWLSVDSSA TNARFRPVES VVAGFARNAA GTVSFSTIDV NVNAIKLYDV
NATSITSAAT QAQFTAGQSL TGTPLFTNGT ADFSGTNEVN FTLQIDRLGT AGGAAGTAAG
AYGGKVNIVL NNSTLITAAN DRSKVTTDEF LRAINNVIGA STLPQTGAGG SAVAITTGGL
KGLITAALDS SGRLVFRTTD TGATLTATLT VGTATAGNTL KDFGFGTTAG LAATGKGTDA
GTTTARGIID TSVGSYDASF GGGSYSIANF DISKLVGTAG DSNLKDIIAA VDKALAAVTD
AGTKLGAGKN QIEGQTSFVD SLMKANTATI GTLVDADIEE ESTKLKALQT QQQLAVQALS
IANSSGQALL TLFR