Gene Mlg_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1013 
Symbol 
ID4270042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1151893 
End bp1153563 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content67% 
IMG OID638125764 
Productserine phosphatase 
Protein accessionYP_741856 
Protein GI114320173 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAG CGGCAGCGCC GGGCCGGCGG GTTTCGCGGG GTCTGACCTT TAAGCAGGCG 
GTCACCACCC TGGTGGTGGT CTTCCTGTTG GGTGTGCTGG CGGCCGCCGT GGAGATCTAC
GCCGACTGGC GCTCCATGCG CGACGAGGTC CGTACCCATA TGCAGCAGAC CCTCGCCATG
GTCGAGGGCT CGGCGGTGGA CGCCGCCTTC AACCTCAACC CGGACGGTGC CAATCAGATC
GCCCGGGGGT TGGCCGAGTA CGAATTCATC CAGCGCGTGG AGTTGCACGA TAACTTTGGC
GACCGGCTGG CGCTGTACGA TCGCCCCCGG GCGCTGAATG AGAATGGGAT GGCGGGCCGG
CTGTTCGACG ACCTTTCCCA CTGGGAGGTC GAGCTGATCC GGCAGGACAT CATTGGCCGT
GCTGAGCCCG TCGGTGAATT GCGGGTGCGA CTGGACCCGG AGGTGATCGC CGCCGGTTTC
GTGCACCGCA CGGCAGTGAA CGCAGGGGTC AACTTGATCA AGGCGCTGGC CATTTCGTTG
CTGGTCGTCG CGATCTTCCA CTTCCTCATC ACCCGTCCGC TGTTGCGGCT GAACACCGCC
ATCGCCGGTG TAGACCCCGC CCACCCGGGC GATTGGTCGA GGCCGGGTAT GCCCGGCCAC
CAGGGCGACG AGCTGGGCCA GATCGTGGGC TCGCTGGACC GTCTGCTGGG GGCCTTTCAG
AAGGGGTTGA ACCAGCGGGA TCAGGCCGAG GGTGAGTTGA AGGCGCTGAC CGAGGAGTTG
GAGCAACGGG TACAGGACCG CACCCGTAAG CTGCAGGATG CCATGGACGA GCTTGCCGCG
GAGAAGGAAG AGACCGAGGC CGCCTATGGG CGCCTCAACG AGGCGCACCG GGAGTTGGAG
CGGGCCAACC GCCTGGTGGT GGAGAGCATC CGCTACGCCC GCCGGATTCA GACGGCGATG
CTCCCCGACA AGTCGGCACT GGGCGATGCC GTCCAGGAGA TCCATGTCTG CTGGGAGCCG
CTCCACCTGG TGGGCGGGGA CTACTTCTGG CTGGAGCGTT TCGGGCGGCA GAGCCTGATT
GTGGTGGTGG ATTGCACCGG GCACGGTGTG CCAGGCGCCT TCATTACCCT GGTGGTGGCC
TCGGCGCTGG ACCGTATCCT CCACGAGCGC GATCTGCGCA GCCCGGCGGA GATCCTCACC
GCCCTGGACG AGATGGTCCG GGCCCGCCTG CGTCAGGATG GGGAGGAGCC GGAGTCCGAC
GACGGTCTGG ATGCCAGCAT CTGCCTCTGG GACGAGGCCG ATCGCAGCGT GACGTTCTCC
GGTGCCGGCC TGCCCTTGAT TTACGTCGAG GATGGCGAGG CGCATGAAAT CAAGGGCAAC
CGGGCCGGCC TCGGCTACCA TAGCCTGGTC CCGCGTAAAC CGTTCGTGGA TCACCGGGTA
CCGGTGAAAC CGGGGATGTC CTTCTACCAA CTCACCGATG GCATCCCCGA CCACATGGGT
GGGGAGCCCC GGCGACTGCT CGGCCGCCGG CGGGTGCGCC GGCTGATCGC CCGTAACGCC
CATCTGCCCA TGGCCGAGCA GATCCAACGC CTGGAGGCGG AGCTGGAACG CTACCGTGGC
CCCGAACCCC GCCGTGATGA CATGACCCTG GTGGGCTTCC GCCCCCTCTG A
 
Protein sequence
MQEAAAPGRR VSRGLTFKQA VTTLVVVFLL GVLAAAVEIY ADWRSMRDEV RTHMQQTLAM 
VEGSAVDAAF NLNPDGANQI ARGLAEYEFI QRVELHDNFG DRLALYDRPR ALNENGMAGR
LFDDLSHWEV ELIRQDIIGR AEPVGELRVR LDPEVIAAGF VHRTAVNAGV NLIKALAISL
LVVAIFHFLI TRPLLRLNTA IAGVDPAHPG DWSRPGMPGH QGDELGQIVG SLDRLLGAFQ
KGLNQRDQAE GELKALTEEL EQRVQDRTRK LQDAMDELAA EKEETEAAYG RLNEAHRELE
RANRLVVESI RYARRIQTAM LPDKSALGDA VQEIHVCWEP LHLVGGDYFW LERFGRQSLI
VVVDCTGHGV PGAFITLVVA SALDRILHER DLRSPAEILT ALDEMVRARL RQDGEEPESD
DGLDASICLW DEADRSVTFS GAGLPLIYVE DGEAHEIKGN RAGLGYHSLV PRKPFVDHRV
PVKPGMSFYQ LTDGIPDHMG GEPRRLLGRR RVRRLIARNA HLPMAEQIQR LEAELERYRG
PEPRRDDMTL VGFRPL