Gene Mpe_A2866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2866 
SymbolfliC 
ID4785560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3055232 
End bp3056446 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID640091437 
Productflagellin-related hook-associated protein 
Protein accessionYP_001022055 
Protein GI124268051 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGA CCATCAACAC GAACGTCATC TCGCTGAATG CCCAGCGGAA CCTGAACACC 
AGTCAGTCCT CGCTGGCCAC CTCGATGCAG CGGCTGTCGT CGGGTCTGCG CGTCAACAGC
GCGAAGGACG ATGCCGCCGG CCTGGCGATC GCCGAGCGCA TGAACACCCA GGTGCGCGGC
CTGAACGTCG CGGCGCGCAA CGCCAACGAC GGCATCTCGC TGGCCCAGAC CGCCGAAGGC
GCGCTGGGCA AGCTGGGCGA CATGCTGCAG CGGATGCGCG AACTGGCGGT GCAATCGGCC
AACGCCACCA ACAGCGCCGA CGACCGCAAG GCGCTGCAGG CCGAAGTGAA CCAGCTGCGC
GACGAAATCG ACCGCGTGGC CAAGCAGACC AGCTTCAACG GCAAGAAGCT GCTCGACGGC
TCCTTCACTG CGGCGACCTT CCAGGTCGGG GCCAACTCCG GCGATGCGAT CACCGTCGGC
AGCCTGACCA ACGCCACCGC GGCGGTGCTC TCGAAGATCA CCTACGCCGA GGGCGCGAGT
GCCGACCTGA CGCTCGACGG CACGACCATC ACGGACCTGG CCGCGATCGC CGACGGCGAC
CTGCAGATCA CGATCGACGG CGGTGGCGCG AACGAGCAGA TCGTCGAGGT CGGCCCGATC
GCCGAGGCCA GCACCGAAAC CGAACGCCTG GGCCAGATCG CCGAGGCGAT CAACCGCAAG
ACCACGGACA CCGGCGTCTC CGCCTACCTG GTGAAGAACG ACGACGGGAC CTTCAAGCTC
GACATCAAGG CATCCAAGCT GGATGCTGCG GGCGCGGCGC TGGCGGTGGA GTTCACCGGC
TTCACCACGG CGACGACGGG CCTGGACGAA GGCGACGCGA TCGCCGAGGT CGCCGACGAC
ATCGGCCTGA GCGACTTGAA CATCGAGACC GATTCCGCCA CCTGGGTGTC GATCAAGAAG
ATCGACAGCG CGCTCGACCA GATCAACGGC GCACGCGGCA CGCTCGGCGC GCTGCAGAGC
CGCTTCGAGA ACGCGGTGTC GAACATCCAG ATCCAGGCCG AGAACCTGTC GGCCGCGCGT
GGCCGCATCA TGGACGCCGA CTTCGCGATG GAAACGGCGA ACCTGTCCCG CGCCCAGATC
CTGCAGCAGG CCGGCACCGC GATGGTGGCG CAGGCCAACC AGCTGCCTCA GCAGGTGCTG
TCGCTGCTCA AGTAA
 
Protein sequence
MAQTINTNVI SLNAQRNLNT SQSSLATSMQ RLSSGLRVNS AKDDAAGLAI AERMNTQVRG 
LNVAARNAND GISLAQTAEG ALGKLGDMLQ RMRELAVQSA NATNSADDRK ALQAEVNQLR
DEIDRVAKQT SFNGKKLLDG SFTAATFQVG ANSGDAITVG SLTNATAAVL SKITYAEGAS
ADLTLDGTTI TDLAAIADGD LQITIDGGGA NEQIVEVGPI AEASTETERL GQIAEAINRK
TTDTGVSAYL VKNDDGTFKL DIKASKLDAA GAALAVEFTG FTTATTGLDE GDAIAEVADD
IGLSDLNIET DSATWVSIKK IDSALDQING ARGTLGALQS RFENAVSNIQ IQAENLSAAR
GRIMDADFAM ETANLSRAQI LQQAGTAMVA QANQLPQQVL SLLK