Gene M446_5135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5135 
Symbol 
ID6131051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5640661 
End bp5642433 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content67% 
IMG OID641645270 
Productflagellin domain-containing protein 
Protein accessionYP_001771895 
Protein GI170743240 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.129659 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCC TGCTCACCAA CTCCGCGGCG ATGACTGCGC TCACCACCCT CAAGTCGATC 
AATGCCCAGC TCGACATGAC CAGCAACCGG GTCTCGACCG GGCAGCGCGT CTCCGCCGCC
TCCGACAACG CCGCCTACTG GTCCATCGCC ACCACGGTGC GCACCGACAA CGCCTCCCTG
GGCGCGGTGA AGGACTCGCT CGGCCTCTCC TCCTCGGCCG TCGGCACCGC CTATAACGGC
CTCAACAGCA TCCTCTCCGA CCTGCAGAAC CTGCGCGCCA AGCTCCAGAC CGCCACCAAT
TCCGGCACCG ACCGCTCCAA GGTCCAGACC GAGATCGCGA CCCTGCAGAG CAAGATGAAG
GCGACCGCCG ATTCCTCGGT CTCCAGCGGG CAGAACTGGC TCTCCGTCAA CTCGGCTGCC
ACGAACACCA GCTACAAAGC CGTCCAGAAC GTGGTGGCGG GCTTCTCGCG CGCCAGCGAC
GGCACGATCA ACTTCTCGAT GATCAACATC AACGTCGGCC AGATCAAGCT CTACGACACC
AACTCGACCA GCGTCACGGC GGCCGCGACC AACGCCCAGG TCGTCGGCTC GACCTCCCTC
ACCGGCACGA GCGGGTTCGG CTCCGGGGCC GGCACGGCGG ATTTCAGCGC CATCAACGAG
GTCTCGCTGG CGATCTCCGT GGGCGGCCAG ACGGGCAACA TCGTGCTCAA CTCGTCCACG
ATGGCGACGG CTGCCAAGGA CCTGTCCAAG GTCACGACCG ACGAGTTCCT GAAGGCCATC
AACAACCAGA TCTCCGCCTC CGCGACCCTG GCCGGCAAGG TGACGGCGGG CCTGGATTCC
TCGGGCCGGC TGACCTTCAA CTCGACCGCG ACCGGCGCGG CCACCACCCT GTCGGTCTCC
GTCGGCAACC CCTCGGCCGG CAAGGCCCTG ATCGACGTCG GCTTCGGCAC CACCACGGGC
GTCGCCACGG TGGCGATGGT GCAGGGCAGC GCCTTCACCA ACCTCGACCT CTCGACGGGC
AGCAAGGCCA TCACGATCAG CGACGGCACG ACCTCGAAGA CCATCACGCT GGACGCGGCG
GCCTATGCCG CCCTGGCCAC CAAGACCTCC GCCGGCAACA ATGCCGTCAA CGCCACCGAC
GTGGCGGCGA TGATCAACAC CACGCTCTCG GCCACGACCG CTGCCAAGGT CGCGGCGTCG
GTCCAGGGCG GGAAGCTCGT CCTGACCAGC ACCGACACGG CCGGTGCGGG CTCGAAGATC
ATCATCAGCA ACGCGGACGC GGCGTTCGGC TTCACGGCGG GCACCACCAC TGGCACGGAC
GTCCCGGCAG GCGGTGTTGT GCCGCGCACC GGGACCGGCA CCGACGCCGG GGTCACCCAG
GCGCAGGGCA TCCTGGACAC CGCGATGGGC ACCTACAACG CGCAGTTCGG GGGCGGCTCC
TACTCGATCG CCAACCTGGA CATCTCGAAG CTGGTGGGCA CGAACGGCGA CGCGGACCTG
AAGAACATCA TCTCGGCGGT CGACAAGGCG ATCGCGGCGG TGACGGATGC GGGCACGAAG
CTCGGCGCGA GCAAGACGCA GATCGAGGGG CAATCCTCGT TCGTGGACAC GCTGATCAAG
GCCAACCAGG CGACGATCGG GACGCTGGTG GACGCGGACA TCGAGGAGGA ATCGACGCGG
CTGAAGGCGC TGCAGACCCA GCAGCAGCTG GCGGTGCAGT CGCTGAGCAT CGCCAACGGG
GCGAGCCAGA ACCTGATGAC GCTGTTCCGC TGA
 
Protein sequence
MTSLLTNSAA MTALTTLKSI NAQLDMTSNR VSTGQRVSAA SDNAAYWSIA TTVRTDNASL 
GAVKDSLGLS SSAVGTAYNG LNSILSDLQN LRAKLQTATN SGTDRSKVQT EIATLQSKMK
ATADSSVSSG QNWLSVNSAA TNTSYKAVQN VVAGFSRASD GTINFSMINI NVGQIKLYDT
NSTSVTAAAT NAQVVGSTSL TGTSGFGSGA GTADFSAINE VSLAISVGGQ TGNIVLNSST
MATAAKDLSK VTTDEFLKAI NNQISASATL AGKVTAGLDS SGRLTFNSTA TGAATTLSVS
VGNPSAGKAL IDVGFGTTTG VATVAMVQGS AFTNLDLSTG SKAITISDGT TSKTITLDAA
AYAALATKTS AGNNAVNATD VAAMINTTLS ATTAAKVAAS VQGGKLVLTS TDTAGAGSKI
IISNADAAFG FTAGTTTGTD VPAGGVVPRT GTGTDAGVTQ AQGILDTAMG TYNAQFGGGS
YSIANLDISK LVGTNGDADL KNIISAVDKA IAAVTDAGTK LGASKTQIEG QSSFVDTLIK
ANQATIGTLV DADIEEESTR LKALQTQQQL AVQSLSIANG ASQNLMTLFR