Gene M446_5137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5137 
Symbol 
ID6131053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5645817 
End bp5647202 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content67% 
IMG OID641645272 
Productflagellin domain-containing protein 
Protein accessionYP_001771897 
Protein GI170743242 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0282241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCC TGCTCACCAA CAGCGCCGCG ATGACGGCCC TGACCACCCT CAAGAGCATC 
AACACCCAGC TCGACACGAC CAGCAACCGC GTCTCCACCG GCCAGCGCGT CTCCAACGCT
GCCGACAACG CCGCCTATTG GTCGATCGCC ACCACGATCC GCACGGACAA CAGCTCGCTC
GGCGCGGTGA AGGACTCGCT CGGGCTCGGC GCCTCGACCG TCGACACGGC CTATAACGGC
CTCAACAGCA TCCTCACCGA CCTGCAGAAC ATCCGCGCCA AGCTCCAGAC CGCCACCCAG
GCGGGCGTCG ACCGCGCCAA GGTGCAGACC GAGATCGCGG CGCTCCAGAG CAAGATGAAG
GCGACCGCCA ACTCGTCGGT GTCGAGCGGC CAGAACTGGA TCTCGGTGGA TTCGTCGGCG
AGCGACTACC AGGCCATCCG CAAGATCGTG GCCGGCTTCT CCCGCGGCTC GAACGGCGCG
ATCAACTTCT CGTACGTGAA CGTCGATGTC GGCAGCATCA AGCTGCTCGA CGCCAATGCC
GGCTCGAGCG TGACAGTGGC GGCGACCCAG GCGCAGGCCT CCGGCACCAC CTCGCTGACC
GGCACGACGG CGTTCACCGG CGGCACGGCC GACTTCTCGG CCGCCCAGAC GGTCGAGCTG
ACGATCACCA CCGAGACCGG CAACGCGACC ATCAAGCTCG ACAAGGCCGC GCTGACCACC
GCCGCCAAGG ACCTCACCAA GGTCACCACC AACGAGTTCC TCTCGGCGCT CAACAACGCG
ATCAGCGCCA GCACGCTGAC CACCGCGGGC GTGCCGAGCG TCACCGCCAG CCTGGACTCG
GCGGGCCGCC TCACCTTCAC CCGGAGCGCG ACGGGTGCGA CCAACACCCT CAAGATCGAC
ACGACGGCCA ACAACACCGT CGACATCGGC TTCGGCGCCG CCAGCGTCAC CGGCGTCGTC
AACAAGGGGA CCAACGCGAC CACCACGACC GGCAAGGGCC TGCTCGACAC GGCGACCGGT
ACCTACACGG CCGGCGGCGG CATGTCGGGC TCCTACAGCG TCGCGAATTT CGACATCTCC
AAGCTCGTCG GCACCAACGG CGACACCGAC GTGGCCAACA TCATCACCAT GGTCGATCAG
GTGATCGGCA AGGTTACGGA TGCCGGCACC AAGCTGGGCG CGGCCAAGAC GCAGGTCGAC
GGCCAGAAGA CCTTCGTGGA CACCCTGATG AAGGCCAACA GCGCGACGAT CGGCACGCTG
GTGGATTCGG ACATCGAGGA GGAGTCGACG AAGCTGAAGG CGCTGCAGAC GCAGCAGCAG
CTGGCGGTGC AGTCGCTGAG CATCGCCAAC TCGTCGAGCC AGAACCTGCT CTCGCTGTTC
CGCTGA
 
Protein sequence
MTSLLTNSAA MTALTTLKSI NTQLDTTSNR VSTGQRVSNA ADNAAYWSIA TTIRTDNSSL 
GAVKDSLGLG ASTVDTAYNG LNSILTDLQN IRAKLQTATQ AGVDRAKVQT EIAALQSKMK
ATANSSVSSG QNWISVDSSA SDYQAIRKIV AGFSRGSNGA INFSYVNVDV GSIKLLDANA
GSSVTVAATQ AQASGTTSLT GTTAFTGGTA DFSAAQTVEL TITTETGNAT IKLDKAALTT
AAKDLTKVTT NEFLSALNNA ISASTLTTAG VPSVTASLDS AGRLTFTRSA TGATNTLKID
TTANNTVDIG FGAASVTGVV NKGTNATTTT GKGLLDTATG TYTAGGGMSG SYSVANFDIS
KLVGTNGDTD VANIITMVDQ VIGKVTDAGT KLGAAKTQVD GQKTFVDTLM KANSATIGTL
VDSDIEEEST KLKALQTQQQ LAVQSLSIAN SSSQNLLSLF R