Gene Mext_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0349 
Symbol 
ID5832109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp397030 
End bp398124 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content66% 
IMG OID641366135 
Productflagellin domain-containing protein 
Protein accessionYP_001637844 
Protein GI163849801 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGCG TCTTCACGAA CCCCGCGGCT GTCGTCGCGC TGCAGACCCT GCGCGCGGTC 
GTCAGGGACC GCGACGCGAC CTTGCGACGG CTTTCGACTG GATTACGGAT CGGCTCGGCT
GCGGACGAGG CGGCCTACTG GGCCATCGCC TCGACCCTGC GGGCGGACAA CGGCTCGCTT
GCCACGACGC GAGATGCCAT CAGCCACGAC CGCAACACTG TCGAGGCCAT GGCCCGGCGC
TTGGATCGGG TGATCGACCA ATTGGGCGCT ATCGGTCGCA CGCTGGTCTC GGCCTCCGGC
GCGCAGGCCG ACACGACCAA GCTGCAGGTC GACCTGCGCA TCGCTCTCGA TGCCATCCGG
CTCACGGCGG ACAACGCCAT CATGAACGGC GCGAACTGGC TCTCGGTTAA TTCGGAGGAG
CCCAATTTCT CGGTGACCCG GAACCTCGTC ACGGCCTTTT CTTGCCAGGG CGGCAGCGTT
GCGGTCGGAA CCTCGGCCTT CGACACCTCG GGCATCATCC TGTTCGACGC CCGGGCTCGG
GAGGACGGCA GAGGCGCTTT CAGCCGCACG CCGGCCGTCG GCTGCATCCC GACCCTGGCC
CGGGGCATAG CTCGGACCGC ATCGGTGGCA TCCCCCGATG GCTACAGACT CCAGACTTGG
GACGGGCCTA GCCAGCGTGG CGGCCAAACC CTGTACCTGA CTTGGAACCA CGGCCTGCTC
GACACGCAGT TCTACGTCCG AGACGGCAAC GCCGAGCAGC AGCCCTTCTC CATCGCCTCG
ATGGACCTGA CGTCGCCTTA TGCGGATGCC AAGATGATCC AGGCATACGC CAAGGTGGTG
GACGCTACCC TGCAGGTACT GCTCGATGGG GCCGCGAAGC TCGGCGCGAC TTCCGCCCTG
CTCTCGTTGC AGCAGAATTT CGCGGGTAGG TTGATGGACA TCAATGCTTC CGCAATCGGC
GCGCTGGTCG ACGCCGACAT CGAGGAGGCC TCGGCGCGGC TGAAGGCGCT CCGAGTGCAG
CAGCAACTCG GGCTGCAATC GCTGAACATC GCCAATGGCG CCTCCCAGGC CATCCTCGTC
CTGTTCCGGC AGTAG
 
Protein sequence
MTCVFTNPAA VVALQTLRAV VRDRDATLRR LSTGLRIGSA ADEAAYWAIA STLRADNGSL 
ATTRDAISHD RNTVEAMARR LDRVIDQLGA IGRTLVSASG AQADTTKLQV DLRIALDAIR
LTADNAIMNG ANWLSVNSEE PNFSVTRNLV TAFSCQGGSV AVGTSAFDTS GIILFDARAR
EDGRGAFSRT PAVGCIPTLA RGIARTASVA SPDGYRLQTW DGPSQRGGQT LYLTWNHGLL
DTQFYVRDGN AEQQPFSIAS MDLTSPYADA KMIQAYAKVV DATLQVLLDG AAKLGATSAL
LSLQQNFAGR LMDINASAIG ALVDADIEEA SARLKALRVQ QQLGLQSLNI ANGASQAILV
LFRQ