Gene Mfla_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1842 
Symbol 
ID4000969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1980615 
End bp1981685 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content57% 
IMG OID637938758 
Productrhomboid-like protein 
Protein accessionYP_545950 
Protein GI91776194 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000517898 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACC TAACCGATAG CTGGCCTTCC CTAGGTGCTG CAATCAAGCA GCGCACGCCA 
GGCATCCCGG TGACCAAGTG CCTCATCGCG GCCAACTTGC TGGTATTTGT GCTGATGCTG
TTCAATGGTG CTGGTTTCTG GCACTCCCCC AACAATGTGC AGCTGCAGCT TGCATGGGGT
GCCAACTTCG GTCCTGCGAC GCAGGATGGG GAATGGTGGC GTTTGTTCAC CGCCTTGTTC
TTGCATTTCG GCGCTGTGCA CCTGGCATTG AACATGATTG CATTCTGGGA TGGCGGGCAG
CTCGTGGAGC GCATGTATGG CCATTGGCGC TACCTGGTCA TCTATCTGGT CAGCGGCTTG
GTCGGTAACC TGCTTTCGCT GGTGTGGCAG GGCAACCAGG CGGTATCTGG TGGGGCATCT
GGCGCGATTT TCGGCATTTA TGGCGCCTTG ATCGTATTTC TCTGGCAAGA ACGGGCGTTG
CTAGACCGAC GCGAGTTCCG TTGGCTGTTC GGGGGCGCCT GCGTATTCGC AACCGCCACC
ATTGCGCTGG GCTTCATGAT TCCCGCGATT GACAATGCCG CACATATTGG CGGCTTTGTC
GCGGGCATGC TGGCTGGATT GCTGCTGATG CGCGGCTTGA GGCCGCAAGA GGTAGTGCCA
CGTTTGCCCC GGCTGATCGG CGGCAGCCTG CTGGTAGCTG CTATTGCCAT CATGCTATAC
AAGCTTCCAG CGCCTAAATA CAGCTGGGGC GATGAGCTGC TGCTGCAAAA GGAAATCAAT
GCTTTCATTC AGGAAGATCA GGCGATCAAC CGTTCTTGGC TGCATATCAT GCATGAGAGC
AAGCAGGGCA ATGTGACATA TTTTGAGCTG GGCGAGCAGA TCGAGAACGA TATCACCGAC
CGCTATCAGG AACGCTATGA GGCCTTGTCG CAACTGCCCT ATGATCCTAA TCTACCGTCT
GCCGCCAAGC TAGAGAACAT ACTGCAATAC ACCAAGCAGA AGCGCGATGC TTCGCGGGCT
TTGGCCGAGG AGCTCAAGCA GGGCAGAAAG CCGTCTAAAC CTGCGCCCTG A
 
Protein sequence
MTDLTDSWPS LGAAIKQRTP GIPVTKCLIA ANLLVFVLML FNGAGFWHSP NNVQLQLAWG 
ANFGPATQDG EWWRLFTALF LHFGAVHLAL NMIAFWDGGQ LVERMYGHWR YLVIYLVSGL
VGNLLSLVWQ GNQAVSGGAS GAIFGIYGAL IVFLWQERAL LDRREFRWLF GGACVFATAT
IALGFMIPAI DNAAHIGGFV AGMLAGLLLM RGLRPQEVVP RLPRLIGGSL LVAAIAIMLY
KLPAPKYSWG DELLLQKEIN AFIQEDQAIN RSWLHIMHES KQGNVTYFEL GEQIENDITD
RYQERYEALS QLPYDPNLPS AAKLENILQY TKQKRDASRA LAEELKQGRK PSKPAP