Gene Mfla_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1821 
Symbol 
ID3999978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1961871 
End bp1965041 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content61% 
IMG OID637938737 
Producthypothetical protein 
Protein accessionYP_545929 
Protein GI91776173 
COG category[S] Function unknown 
COG ID[COG2911] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAC GGCTACTGCT CATCCTGCTG CTGTGGTTCC TGCCCTGCAT CGCGCCGGCA 
TATGCCTTGA TCCAGCTCAG CTCCAGCCTG CAGGACCTGC ATTACGATGC GGGCAGCATG
GAGATCGACC TACTGCAGTT CGAGGGCCAC CTCAGGCTAC TGCCCACACG CGAAGGACGC
CTGCGCGTGG ATCGGCTGCA TGCGCAGCGG CTGGTCATCA CCATGAAGCA GGCTGCGCCT
GAAGATGCGC CCGATACCCC CACCGGCAGC CTGCCCGAAT TGCCCGACGC CATCCAACTG
CCATTCCCGA TCGAACTGGA ACAGGCGTCC ATAGATGAAA TCGAGATCGT GTCCGGCAAC
AGCCGCCAGG TGCTGCAGCA TGTCACGCTC AACATGGCGG CGGACAGCAA CACCATCCGC
CTGCAGCTGT CGCGCGCGGA AACGCCCTGG GGCGACATCA CGGCCGAGGT CGTCCTGCGG
AACAGCAAGC CATTTCCCTT GTCGGGCAAC ATCCACATCC GCCAGGCGCA AGGCAACATG
CCGTATGACC TCAACACCAC CCTTTCCGGT GACCTCCAGG CATTGCGCTT TGACATGCGC
AACCTGTTTG CAATGCAGAA CGAGCTGCCT GCCATGGTCG CGATCGATAG TCCGCTGCCT
GCGGCTGGCA GGCTGGATAT TACGGGAGAA GTTGCCCTGG AGGGCGACCT GCCGCTCAAG
CTCGACATCC GGTTACGCGA CTTGAACCCG AACGCGTTGG GCCAGGCGAC CGACGCTCAC
CTGAGCGCCG ACCTCAGCGT CAGCGGCAAG CTGCTGCCCG CTGCCGACTT CACCGTGGCA
TTTAGCAGCC TGGACAGCCA ATGGCGCGGC CATCCTCTGC GAGCCACAGC CAGCATACGC
CTGCTGGATA ACATCCTGGA CAGCATCCAG GTCGATGCCA CACTCGGCGA CAACCGCATC
AGCGCTCGAG GTAGCCTTGG CCAGCCCGGC GGCACCCTCG CCTGGCAAGC GGCCTTTCCC
GTATTGGCTG CACTAGGGCC GGCGTTTGCT GGCAAGGCCG AAGCCAGCGG CACAGTAGCT
GGCGAGTTCG ACGATCTGCG CGCGCAATTC CAGCTGCTCG CGGAAAACCT GCGCCTTCCA
GGCAATATCA CGGCCCATCA GCTCAGCGGC AACGGCAGCC TGCACACCCA GGGCGAACTC
AATGCAGCGC TCAATGCTTC AGGCCTGCGC ATCCACCAAG GCAGCTTCAT GGATGGCAAA
ATCGCTCTGA CCGGCAACCG CGCGCGCCAT ACGCTCAGAA TGGAAGCCAC CGGTCCTGGC
CTCAACCTGC AATCCACCCT TGATGGCGGC ATCGATGACA CCGGTGCCTG GACAGGCGTC
CTGCACCAGT TCGAATACCA ATCACAGGCA CCGGTCAAAC TGCAAGCGCC CGCCCCCATC
CGCTACGATG CGGCCGGGCT TGCAATCGAC GACCTGACCT TGCAGTTCAA GCAGGGCATC
ATCCGCCTGG ACACCTTGCG CCAAGGTCCC AATGGCTTGC AGACGCAAGG CCAGATTGAG
CGGTTGGCCT TGCAGGACAT TCCGCCGCTG TTATTCTCAT TGCCTGCCAA TCTCAAGGGC
AATCCGGTAT TTTCAGGGGA ATGGGATATC AATGCCGGGG AACTGCTCAA CGGTAAAGTC
ATGCTGCAGC GCGAGTCCGG CGATATCGCC GTGGTACGCG AAGGCCAGCC CGAACAACCG
CTAGGCCTCA GCGAAGTTAA ATTACTGCTG GCGATGCGTG ACAATCAGGT CGGCCTGACG
GCCAGCATTC GGGGTAGCGG GCTCGGCAAT ATTGCCGGAA GCCTCAATAC CAGTGTCACC
AGCGCAGATG GCAGCCTCAG CCTCGTCAGC AGTGCACCGT TGCAAGCCAG CCTGATCGCG
GAAGTGAACA GCCTTGCCTG GCTGCCCTCT CCCGACATCC AGGCGGATGG TACCCTCCAT
ATCGATATCC ATGCCGATGG CAGCATCGGC AACCCCAACC TGGACGGCAA TATCCGCGGC
CGCAACCTCA GCGCCAGCCT GCCGGCCGAA GGCGTGAACC TCACCAATGG CCAGCTCGAC
GCATCTCTGT CAGGCAACAG GCTGATTCTG GATACGCTGC GCTTCACAGG AGGCCAGGGT
ACCATCAATG CCAGTGGCAA CATCAGCCTG GTAGCAGGCA AACCGGCAAT GGAGCTCGAC
TGGATCCTCG ACGACTTCAC GGCGGCCGAA CGCACTGACC GCACATTGGT ATTGAAAGGC
AGCGCAAAAA CCACACTACA GAACAATGAA CTCATCCTGG ACGGCGACCT GCGCATCGTG
CGTGGCTTGA TCGAACTGGC GTCCGAAGGT GCCCCGCAGC TAGACAACGA CGTCGTCGTG
GTAGGACGTG AGCGCGCGGA AGAGCCGGCC CCATTGCAAT TCACGATTGG CCAGCTCAGG
ATCAACCTCG GTGACGAAGT CATTGGCATC GTCGATCCGG GCAAGCAGTT CCTGCTGCGC
GGTTTCGGCC TGGATGGCTA CCTCACCGGC ATTCTCACCC TGTCAGGAAG CGTACCCAAC
GGCCTGCGTG CGGAAGGTTC GATCAGGGTT GGCGGCACCT ATATGGCCTA TGGTCAGCTG
CTCAACATCA AGCAGGGCAT CGTCAACTTC AGCGGCCCGG TGGATAATCC CGGCCTCAAC
ATTACCGCCA TGCGCGAGAA CCAGACCGTA CAGGCCGGCG TCGAAATCAC CGGCAATGCC
CAGATGCCCA TGGTCAAGCT GGTTTCCAAC CCCAGTGTGC CGGACAGCGA AAAACTCTCC
TGGCTGGTGC TCGGCCATGG CCTGGACCAG GCAGGCAAGA ACGAATTCGC CATGCTCTCG
CTCGCCGCCG GCGTCCTGCT CTCTCAGGGA GACTCCGTTC CGCTACAGAC CAGGATGGCC
CGCGCCGCAG GCCTCGACAG CTTCAGCATA GGCGGCAGTG ATATGGAAAG CACTACCGTC
AACTTCGGCA AGCGCCTGTC TTCCAGGCTC TACCTCAGCT ACGAAAAGAG CCTGACCGGC
CTGCTCAACG TCGCCAAGCT TACCTACACC ATCAGTCGCA ACTGGTCGGT CGTCTCGCAA
GCTGGGTCGG AAAGCGCGGT GGATGTGCTT TATACTTTCC GCTTTCAATA A
 
Protein sequence
MAKRLLLILL LWFLPCIAPA YALIQLSSSL QDLHYDAGSM EIDLLQFEGH LRLLPTREGR 
LRVDRLHAQR LVITMKQAAP EDAPDTPTGS LPELPDAIQL PFPIELEQAS IDEIEIVSGN
SRQVLQHVTL NMAADSNTIR LQLSRAETPW GDITAEVVLR NSKPFPLSGN IHIRQAQGNM
PYDLNTTLSG DLQALRFDMR NLFAMQNELP AMVAIDSPLP AAGRLDITGE VALEGDLPLK
LDIRLRDLNP NALGQATDAH LSADLSVSGK LLPAADFTVA FSSLDSQWRG HPLRATASIR
LLDNILDSIQ VDATLGDNRI SARGSLGQPG GTLAWQAAFP VLAALGPAFA GKAEASGTVA
GEFDDLRAQF QLLAENLRLP GNITAHQLSG NGSLHTQGEL NAALNASGLR IHQGSFMDGK
IALTGNRARH TLRMEATGPG LNLQSTLDGG IDDTGAWTGV LHQFEYQSQA PVKLQAPAPI
RYDAAGLAID DLTLQFKQGI IRLDTLRQGP NGLQTQGQIE RLALQDIPPL LFSLPANLKG
NPVFSGEWDI NAGELLNGKV MLQRESGDIA VVREGQPEQP LGLSEVKLLL AMRDNQVGLT
ASIRGSGLGN IAGSLNTSVT SADGSLSLVS SAPLQASLIA EVNSLAWLPS PDIQADGTLH
IDIHADGSIG NPNLDGNIRG RNLSASLPAE GVNLTNGQLD ASLSGNRLIL DTLRFTGGQG
TINASGNISL VAGKPAMELD WILDDFTAAE RTDRTLVLKG SAKTTLQNNE LILDGDLRIV
RGLIELASEG APQLDNDVVV VGRERAEEPA PLQFTIGQLR INLGDEVIGI VDPGKQFLLR
GFGLDGYLTG ILTLSGSVPN GLRAEGSIRV GGTYMAYGQL LNIKQGIVNF SGPVDNPGLN
ITAMRENQTV QAGVEITGNA QMPMVKLVSN PSVPDSEKLS WLVLGHGLDQ AGKNEFAMLS
LAAGVLLSQG DSVPLQTRMA RAAGLDSFSI GGSDMESTTV NFGKRLSSRL YLSYEKSLTG
LLNVAKLTYT ISRNWSVVSQ AGSESAVDVL YTFRFQ