Gene Mpe_A3063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3063 
SymbolflgL 
ID4784925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3258401 
End bp3259597 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content71% 
IMG OID640091634 
Productflagellin-related hook-associated protein 
Protein accessionYP_001022251 
Protein GI124268247 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.154617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.835037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTGA GCACTTCCAG CACCTACGAC GCCACCATCG CCAACCTGCA GCGCCGGGCC 
CAGGAGCTGA GCCGCAGCCA GGAGCAGATG AGCAGCCTGA AGCGCGTCAA CCGCGCCAGC
GACGACCCGA CGGCCGCCGC CCGGGCCGAA CGGGCGCTCG CCGTCGAGGC GCGCAGCGCC
GCGACGCAGC GGGCCATCGA CGCCAGCATG AACGCGGTGA GCCAGACCGA ATCGACCCTC
GGCGACGCGC TGGGCCTGCT CCAGCAGGCG CGCGACTCGG TGGTCTCGGC CGGCAACGCC
AGCTTCAGCG ACGCCGAGCG CGCCAACCTC GCGATCAGCC TGCGCCAGAT GCGCGCCGAA
CTGCTGACCA TCGCCAACAG CCCCGACGGC GCCGGCAGCT ATCTGTTCGC CGGGCAGGGC
GCCAGCAACC CGCCGTTCGT CGACACGCCG ACCGGCGTGC AGTTCCGCGG CGTGTCGGGC
GAGACCGGCG CGCCCTCCAC CGAGCCGCTG CCGCTGACGA TGGACGGCCG CAGCACCTGG
CTGCAGGCCT CCAGCGGCAA CGGCGTGTTC GTGACCAGTG CCACCCAGCA GGCGGGCACC
GGGTGGATCA ACACCGGCAG CGTGATCGAT CCGTCGGCTG CGACCGGGGA TCCCTACAGC
CTGAGCTTCA CCGTGGCCGG CGGCGTGACG ACCTACACCG TGCTGCGTGA CGGCAACCCG
ACCGCACTGA CCGACGTGCC CTACCAGAGC GGGCAGGCGA TCGAGATCGA CGGCCTGTCG
TTCACCGTGA CCGGCGCGCC GGCCGACGCC GACCGCTTCG ACATCACCCC GTCCTCGGCG
TCGCTCAGCA TCTTCGATGT GCTCGACCGC GTCGCCACCG AGCTGGAGAC GCCCGGGCGC
AACAACGGCC AGATCGCGCA GTCGGTGGCC ACCGCGCTGC GCGACATCGA CGCGTCCAGC
GCCAAGCTGA CGGCGCAGCG CGCCGTGGCG GGCGAGACGC TGAACCGCAT CGACCGCGCC
GCCGAGCGCC TGAGCGGGCA GAAGCTCGCC GCGCAGACCG ACCGCTCGCA GGCCGAGGAT
CTCGACATGG TCGAGGCGAT TTCCACGTTC CAGAAACAAC AGACCGGCTA CGAGGCAGCG
CTGAAGTCCT ATGCTTCGGT ACAGAACCTG TCGCTGTTCC AATACCTCAA TGTGTGA
 
Protein sequence
MRVSTSSTYD ATIANLQRRA QELSRSQEQM SSLKRVNRAS DDPTAAARAE RALAVEARSA 
ATQRAIDASM NAVSQTESTL GDALGLLQQA RDSVVSAGNA SFSDAERANL AISLRQMRAE
LLTIANSPDG AGSYLFAGQG ASNPPFVDTP TGVQFRGVSG ETGAPSTEPL PLTMDGRSTW
LQASSGNGVF VTSATQQAGT GWINTGSVID PSAATGDPYS LSFTVAGGVT TYTVLRDGNP
TALTDVPYQS GQAIEIDGLS FTVTGAPADA DRFDITPSSA SLSIFDVLDR VATELETPGR
NNGQIAQSVA TALRDIDASS AKLTAQRAVA GETLNRIDRA AERLSGQKLA AQTDRSQAED
LDMVEAISTF QKQQTGYEAA LKSYASVQNL SLFQYLNV