Gene Mpe_A2867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2867 
SymbolfliC 
ID4785561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3056659 
End bp3057873 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID640091438 
Productflagellin-related hook-associated protein 
Protein accessionYP_001022056 
Protein GI124268052 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATGT CAGTCAACAC CAACATCGTT TCGCTCAATG CGCAACGCAA CCTCGGCACC 
TCTCAGTCGT CGCTGGCAAC GTCCATGCAG CGCCTGTCTT CCGGCCTGCG TGTCAACAGC
GCGAAGGACG ACGCCGCCGG TCTGGCGATC GCCGAGCGCA TGAACGCGTC GGTCCGTGGC
CTCAATGTCG CGGCGCGCAA CGCCAACGAC GGCATCTCGC TGGCGCAGAC CGCCGAAGGC
GCGCTGGGCA AGGTCGGCGA CATGCTGCAA CGCATGCGTG AACTGGCCGT CCAGTCGGGC
AACGCCACCA ACAGCGCCGA CGACCGCAAG GCCCTGCAGG CCGAAGTCAC GCAACTGCGC
GACGAAATCG ACCGTGTGGC GAAGCAGACG ACCTTCAACG GCCGCAAGCT GCTCGACGGC
TCCTTCACCG CGGCGGCCTT CCAGGTCGGC GCCGGCGCCG GCGACAACAT CACGGTCGGC
AGCCTGACGA ACGCATCGGC CAGCAACCTG TCGAAGATCA CCTACGCCGA AATCTCCAGC
GGTGACCTGG CGAAGGACGA CACCGACATC ACGACGCTGG ACGCGATCGC CGACGGCGAC
CTGCAGATCA CGATCGACGA CGGCGGCGAC AACGAACTGG TGGTCGAGGT GGGTGCGATC
GCCCAAGCGA GCTCGGGCTT GGAGCGTCTG GGTCAGGTGG CCGAGGCGAT CAACCGCAAG
ACCAGCGACA CCGGCGTGTC GGCCTACCTG GTGGCCAATG ACGACGGCAC CTACAAGCTC
GACATCAAGG CCTCGCGCCT GGATGCCGAC GGTGCCCCGC TGTCGGTGGA GTTCACCGGC
TTCGATACCA CGACCACGGG TCTGGACGAA GGCGACGTGC CCGCCGCGGT GACGGATGCC
ATTGGCATCG ACGCGCTGAG CATCGAGACC GAATCGGATG TGTGGGTGTC GATCAAGAAG
ATCGACAGCG CACTGGACCA GGTGAACAGT GCCCGCGGTA CCCTGGGCGC GATCCAGAGC
CGCTTCGAGA ATGCGGTGTC GAACATCCAG ATCCAGGCGG AGAACACCGC GGCCTCGCGT
GGCCGGATCA TGGATGCCGA CTTCGCGTCG GAAACGGCCA ACCTGTCGCG CTCGCAGATC
CTGCAGCAGG CCGGTACCGC CATGGTGGCC CAGGCCAACC AGCTGCCGCA GCAAGTGCTG
TCGCTGCTGC GCTGA
 
Protein sequence
MAMSVNTNIV SLNAQRNLGT SQSSLATSMQ RLSSGLRVNS AKDDAAGLAI AERMNASVRG 
LNVAARNAND GISLAQTAEG ALGKVGDMLQ RMRELAVQSG NATNSADDRK ALQAEVTQLR
DEIDRVAKQT TFNGRKLLDG SFTAAAFQVG AGAGDNITVG SLTNASASNL SKITYAEISS
GDLAKDDTDI TTLDAIADGD LQITIDDGGD NELVVEVGAI AQASSGLERL GQVAEAINRK
TSDTGVSAYL VANDDGTYKL DIKASRLDAD GAPLSVEFTG FDTTTTGLDE GDVPAAVTDA
IGIDALSIET ESDVWVSIKK IDSALDQVNS ARGTLGAIQS RFENAVSNIQ IQAENTAASR
GRIMDADFAS ETANLSRSQI LQQAGTAMVA QANQLPQQVL SLLR