Gene Mfla_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1041 
Symbol 
ID4000107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1083531 
End bp1085030 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content49% 
IMG OID637937941 
Producthypothetical protein 
Protein accessionYP_545150 
Protein GI91775394 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAACT ATCAACATCC TGGCATTTAT GTACAGGAAA TAGCTATTGC ACCCAACATT 
GAGCCTGTTG CCGCATGCCT GCCCCTATTC ATTGGCCATA CAGAAAAATC CCAGGATGCC
AATGGACATC CATTACCATT GGGTAAGCGC TACAGCGTTT CCTCACTTGC AGAATACGAA
GAACACTTTG GCAAAGGTGC GCCAGAAACC CTACGCGTGC TATTGGACGG CAAAGGTAAT
ATCGTCAATG CCCACAGCTA CAGTCCTTTT TACCTTTATG CAGCCGTACA TCAATACTTC
GCCAATGGAG GTGGACAATG CGAAATTCTT TCCGTTGGCC CATATTCCCG GGCACCAGAT
GCTTCTGCGC TGAAAGCGGC CATTAGAGCA TTACCCCTAA ACGCCGCATT TACACTGGTT
GCGATCCCCG ATGCTGTTTC GCTAGCAGAA TTACCTCAAC TCCAAAGGAA ACTACTGCAA
TATTGTGCAC AACAGGATTA TTGCCTAGCA ATTCTGGATG TCCCCTATTG CAATGACCAG
ACTCGGGAGA TGACGGTCAA TACGTTCCGC CATGATATCG GTCAGCGAGG CCTGCAACAC
GGTGCAGCGT TCATGCCATG GCTGGAAACA GCTGCGGCTG GCATCTCAGG CTATGTCGAC
CTGGAAATCA AATATACGCC TGGCGTTGCC ACAGCCTGGA ATGATTTTCA CAAAGCCACC
AAGCTCCAAT CTCACCTGCT TCATAAACAA CTGGAAGCAT CCCCTGCGAT ACGATTCAAC
AGGATGCTTC CTCCCAGCGG CAGCATCATG GCATTATTTG AAGCCAATGC CCGCAAGCGT
AACATTTGGA CAACCCCTAA CCATACCCCC ATCAAGGAAA TCATTTCATT AAGCGCCACC
ATTGATAACA TCATTCAAGA ATCGCTGAAT ATTCACCCAA CAGGTAAATC AATCAATGCC
ATTCGTCGCT TCGACAATGC TCTCTTGATG TGGGGAGGAC ACACATTAGC CAGCAATGAC
AGCGAATGGC GTTATATAGC CCATTTGCTT ACCCGTGGAT TTGTGCAGGC CTCTCTACGT
CGATTCCTGG ATCAGCAAAC CTTTGAGAAA AATGATGAAG CTACATGGAG TTTGGTTGGT
CACCAGTGCC AGGATTTTCT GCACACGCTT TGGCGGGAGG GTGCTTTAGT TGGAGATAAA
CCTGAACAAG CATTCTATGT CAGGATAGGC CTTAACCAGA CCATGTCCAC CCAGGATATC
GCAGCGGGAC GAATAATTGT GCATGTAGGC ATTGCCTTGC TCAGGCCTGC CGAATTTATC
ATACTAAAGC TACATAAAGT CATATCAGCC CCCGACTTGA AACCAGCAAT CACCAGGCCT
GGAAAACCGG TAAATCGAGT CAAGATCCGG GCTAAGACAA GCCCGATCAA TGTTGTCATC
CGTCAGCGAC AACAACCCAG TCCTCCTAGC AAACCTTTGC CATCGGAGCC GGTTTCGTGA
 
Protein sequence
MGNYQHPGIY VQEIAIAPNI EPVAACLPLF IGHTEKSQDA NGHPLPLGKR YSVSSLAEYE 
EHFGKGAPET LRVLLDGKGN IVNAHSYSPF YLYAAVHQYF ANGGGQCEIL SVGPYSRAPD
ASALKAAIRA LPLNAAFTLV AIPDAVSLAE LPQLQRKLLQ YCAQQDYCLA ILDVPYCNDQ
TREMTVNTFR HDIGQRGLQH GAAFMPWLET AAAGISGYVD LEIKYTPGVA TAWNDFHKAT
KLQSHLLHKQ LEASPAIRFN RMLPPSGSIM ALFEANARKR NIWTTPNHTP IKEIISLSAT
IDNIIQESLN IHPTGKSINA IRRFDNALLM WGGHTLASND SEWRYIAHLL TRGFVQASLR
RFLDQQTFEK NDEATWSLVG HQCQDFLHTL WREGALVGDK PEQAFYVRIG LNQTMSTQDI
AAGRIIVHVG IALLRPAEFI ILKLHKVISA PDLKPAITRP GKPVNRVKIR AKTSPINVVI
RQRQQPSPPS KPLPSEPVS