Gene M446_4184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4184 
Symbol 
ID6129930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4621456 
End bp4622826 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content71% 
IMG OID641644328 
Productflagellar basal body FlaE domain-containing protein 
Protein accessionYP_001770967 
Protein GI170742312 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTT TCAGCGCGAT GCAGACCGCG GTCTCGGGCC TCCAGGCCCA GGCCTTCTCG 
CTCAACAACA TCTCGGGCAA CATCGCGAAT GCCCAGACCA CGGGCTATCG CCGCATCGAC
ACGACCTTCG CCGACCTGCT GGCGGAGCAG CCGAACGACC GGCAGACCAG CGGCTCGGTG
GCGGCCTCGT CCCAGTTCAC CAACACGCTG CAGGGCAACA TCGCCTCGAC CGGCATCACC
ACCAACATGG CGCTCAACGG GGGCGGCTTC TTCGTGGTGC GCACCCCGAG CGCGACGAGC
GGCGGGCAGC CGAGCTTCAC GAGCCAGGAC CTCTACACCC GCCGCGGCGA CTTCGCCGTC
GACAAGGACG GCTACCTCGT GAACGGGGCC GGCTCCTACC TGGTGGGCCA GAGCCTCGAT
CCCCGGACCG GCCAGGCGAC CGGGAACGGG GTCATCCGGA TCGGGAACCA AGCGCTGCCG
GCCCAGGCCA CCACCACCAT CAGCTACGCG GCCAACCTGC CGAGCACGCC CGGCACCACG
GGACCGGCCG TGCTCGGCGC CCTCGCGGGG GGCGACGTGC GGGTGATGTC GGGCACCAGC
GCGAGCCCCG CCACGGTGGC GGCCTCCGAC AGCGGCAGCT TCCTCGCCTC CAGCCTCTCG
GGCGGGTCGC TCACCGCCTA TTCGGCGGCC GGCGGGCCCG TGAACCTGCA GCTGCGCTGG
GCCAAGGTCG CCGCCGCGGA CAGCGCCGCG GGCACCGGCG ACACCTGGAA CCTGTTCTAC
GCCAAGCAGA ACGGCACGGA CTCGACCGCC ACCGCCTGGA CCAATGCGGG GACCGCCTTC
ACGTTCAACG CCAGCGGCCA GCTCACGACG CCGGCCGGCG GTTCGGCCAC GATCCCGAAC
GTCACGGTCG ACGGCGTCCC GCTCGGCACG CTCACGCTGA ACACGGCGTC GGGCGGCCTC
ACCCAGTACG GCGCGGCGGG CGGCCAGGTG ACGACCAATA CCCTGCAGCA GAACGGCTCC
GCCTCCGGCA CGCTCAGGAG CTTGGCGGTG ACCAGCGACG GCCGCGTCAC CGGCACCTAC
TCGAACGGCA CCACCGCCAC CCTCGCGCAG GTCGGCATCG CCCGGTTCAA CGCCCCCGAC
GCGCTCAAGG CCGAGTCGCT CGGCAACTAC GCCCAGACGG TGGAATCGGG CGGCCCGGTC
TCGGGCCTCG CCGGCACCAC CGTCGTGGGC GGCAACGTCG AGCAGTCGAA CACCGACACG
GCCAACGAGT TCTCGAAGCT CATCATCACG CAGCAGGCCT ATTCGGCCAA CACGCGGGTG
ATGTCGACCG CCCAGCAGAT GATGTCCGAC CTCGTCAACA TCATCCGGTG A
 
Protein sequence
MDVFSAMQTA VSGLQAQAFS LNNISGNIAN AQTTGYRRID TTFADLLAEQ PNDRQTSGSV 
AASSQFTNTL QGNIASTGIT TNMALNGGGF FVVRTPSATS GGQPSFTSQD LYTRRGDFAV
DKDGYLVNGA GSYLVGQSLD PRTGQATGNG VIRIGNQALP AQATTTISYA ANLPSTPGTT
GPAVLGALAG GDVRVMSGTS ASPATVAASD SGSFLASSLS GGSLTAYSAA GGPVNLQLRW
AKVAAADSAA GTGDTWNLFY AKQNGTDSTA TAWTNAGTAF TFNASGQLTT PAGGSATIPN
VTVDGVPLGT LTLNTASGGL TQYGAAGGQV TTNTLQQNGS ASGTLRSLAV TSDGRVTGTY
SNGTTATLAQ VGIARFNAPD ALKAESLGNY AQTVESGGPV SGLAGTTVVG GNVEQSNTDT
ANEFSKLIIT QQAYSANTRV MSTAQQMMSD LVNIIR