Gene Mlg_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0041 
Symbol 
ID4270910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp43278 
End bp46577 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content67% 
IMG OID638124767 
Producthypothetical protein 
Protein accessionYP_740889 
Protein GI114319206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG ACATGGAAGC CGCCAACCGC GTCGCGGAGA GCCGGAACGA TAACGACCCG 
GGCCAAGAGC AGGGGCTCTG CCCTCTTACG CGCGGTGAGC TGCAGCTGGT CCCGGTGCGG
TATGCCCTGG TCGAGGCGCC CGAGGCGGAT CGGGCCTCGG GCACGCCGGG CTTCCGGCCC
GTTCATGACG GCAGTTTCCG TCGCTGTGGC GTACGGCCGG TACGGGAAGG CTGGCTCTAC
CTGGTACACA GCTCAACGCC CGATGAGCTG CAGGTCTTCG AGGTCAAACC CGATGGCAGC
GGCGATGCGA TCATCGTCGA GCGCGAGGGC AGCATCCAGG TGCTGTTCAG TCCCTTGGCG
CTGACGGCCG TGCACCAATC GATGCTGCTT AAACCTGCCT TCCGCGATCA GGTGATGACG
CGGGTCAACG TTGGCGCCTA CTGTCCGGGG GCGGGTACCG CTCACCTGCT CGATCCCGAT
GCCCTGGCCG ACGTCCTGGC GGACGACCAT GGCGAACACC GCGCGACCCC CAGCGCCCCG
ACCGAGCAGG ACGGCGACCC GCTCGACCCG GGCAGCTACG CCTGGTGCGA CGCGGAAGGC
GAACACGCCG AGTGGCAGCG TGCGCGCGCC GCCGAGATCA AGGCGGCTAT CCAGGGGGGC
TTTCAGCAAG ACAGCGCCTG CCTGGTAGTG GACGATATCG CCGGACGTAT CAAGGACCTC
GCCCAGGCGT GGGCCTGCCT GGCGGAACAA CAGGGCCAGT GGGTCGACGA GAACGCGGTG
GCGCTCTTCT CGGCACGAAC CATCGAAGGC CTGATGTCGT TGAACCTGGC GCCTCACATT
GCCAATGCAG GCGACGGCAA TATTCCTGAG TGGCTGAGCG AAGCCAGTTA TGCCGAGCGT
AACGACCTGG AGTCGCTCGC CGAGCTCTAT TCGCAGCATC GCGAAGCCAT AGAGCAGGTC
ACTGCCCGTT CAGGCGGCCA CCCGGGCTTC ATTGGCGGGG CGCTGGCGCC CAACGGCATC
GTCGACGTGA TCCGCCGGCA GGCGATGGAA GACTTCCTGG CCTATGCCAC CCCGATGCAG
GCGCAGTGGC AGAGCGAGTA TCACCGTATC GGGGCCGACC TGGCCGCCAT ACTGCCCACC
TGGCACACCC AGGCCCTGCT GCTCGACCGC GAAGAGGAGC ATCACATCCT GCTGACCTGC
CTGCTGGAAA AGCAGGCCGT CGAGACCCTG CTGGCCTGCG GGCAGGAGGA CTTCCTGTCG
AGCTACTACG CCGGCGACGA CCCGGTACCG GCACACCTGA TGCATTACGT CCCCACCGCG
TCCTTCGTGG AGGGATTCCT TTCCCAAAAC ACTGGCCTCC AGAAAGCGCT CACCCAGGCG
TCCGCACTGA TGGGCGCCCA GGGCGCACTC AGCCGCTACC AGCAGTGGCG GGGCGAGGTT
GAACATCAGA CCGGGCTGCG CTTTCGCAGC GTCGAGGGCC TCTCCGACGA AGCCCGAACC
GCCATCGCGG GCGAAGTGCA AATCAAGGAG AAGCTGCTGG GGCAGGCGGT GCTCGGGACG
TTGCTCGATG ACGTACAGGA TGTCGACCTG GGGCAGCGCA TCACTACCCT GGCTTCGCGC
TTGCCCGATG GGCAACGGCT GATGTTTGCC GAACGTCTGG GGCTGCTCGA GCTCGGCTGG
GCCATCCCCG ACCGCTCCGT ACTGGGCCGC ATCCAGCAAG CACTGGACCA GGCGGACTCG
GCCGTGGCCA ACCTGGCAAC CCTTGAGCGC AGGCTCGAGC AGCTCTACCA GGAGCGCCAG
GTCGAGTTGG CACGGGCGTC CAGGCGCGGC ACCAGCCAGG CGCATCGCCG GGCCGCCGAC
CGGTTCAATG CCCGCAAAAT CCGCGAAAGC CAGGCCGACA TCAACCGGCA CAGGCGCCTG
CTGGGCGAGG CATTCGACGC CCTGGCCGAA CACAGCTTCC CCGTCGACAA GGCCAATGGC
CACGCCGTCC AGGTCGGCGG CCTGAGCCTG GCCGCCACCC GTGCCGCCCT GGCCGAGCGT
GCCGCCGACC GGGCCGTTGC CCGTCAACCA GCCGCGACCC TGAACGACAC CTTGAACACC
CTGGTGCGCG ACGAACAGGG CGAGCTGTCG GCGAGTCGGA GCCTGGGGGT GCTGGTTAAT
GGGAGCCTGT CGATGCTGGG TATGATCATG ACCGGCGTGG CCCTGCGCAA TACGCTTGAT
GCCTGGGGCG AAGAACGCCT CATGGAGCAC GCCTTCGCGA CGGGCTCTCA CGCGACAGGG
GCGGCAGCGA GCATCATGGC AATACGAGAA ATGATCATCG ACGCCCGCCA TCGCAACCTC
TACCAGGGGC GGGGCTTTCA GCAGGTGGCC TTGGCCGAGA CCCGCGCCGC CGCCGGAAGC
CCCGCGCAAC TGGAGCGCTG GGCAAGGGTT GCCAATGGGG CGATGGGGGC CGTGGCCCTA
CTGGGTGGCT TTGCAGGTCT CTTAGAAACC TACAAGCAGT ACAAACGAAT GCAGGGATCA
GAAACCCAGG CGGAGCGACT GGCCTTACAG GTTGCCTTTA CAGGTGCGGC CGGAGTCGCC
GGTGGTGGCT TCTTAATCGG CGGCATGAGC GCGCTTGGCA GGATGCTGGG CAAGCCCGCA
GTCGCCTGGC GACTGCTGCT GCTGAAATTC GCCGGCCCCG CCGGCTGGGT GGTGGCGGTT
GGCACCGCCC TGCTGATCAT CGGCGAGGTG CTGGCCAATC GCTTCTCGTT GAGCCCTGTG
CAGCGCTGGT GCCAGCGCAG TCACTGGGGG CGAGAAGATC AGGGCTGGGA TCGCGAGGCC
CACGAGCGGG AACTGGCCCG ACTTGGCGAT ACCGATCTCA CGGTGGAACG GCAGGGGCAG
GCCGAGCCCC ATGGCGGCCC GGGGCCCGGG CCGGCAGGCA CCGACCTCGC CATACGCATT
GGCTTGCCCG GGCTTGACGC CCCCAATGCG GAAAACCTCG CGCTGGGCCT CTGGGGCGTC
ACCCCTCGCC TCAAGGAAAT GACCCGAGAC TTTCTCGAAC ATGCCGAGCT CGAAAACCAG
GGCTCGAGCT ATGCCCTGCA CTACCATTTC GATCCCGAAA CATTGGCCGA ATGCCACGAG
TTCCGCCTCG TCATCCGCAC GAAGGGCCCC GAAGCATCCA CCACCCGGGT CTTCCAGTTG
CATCGCCGCG GCACATCGCT CTCCGATGAG TGGAGGGAGA TCTCCGCCCT CGGCGATCGT
TTCCTCACGC GGTACCAAGT GGGCAACTGG CCGGACATGC CCCTGACGCC CTGGCCGTGA
 
Protein sequence
MSTDMEAANR VAESRNDNDP GQEQGLCPLT RGELQLVPVR YALVEAPEAD RASGTPGFRP 
VHDGSFRRCG VRPVREGWLY LVHSSTPDEL QVFEVKPDGS GDAIIVEREG SIQVLFSPLA
LTAVHQSMLL KPAFRDQVMT RVNVGAYCPG AGTAHLLDPD ALADVLADDH GEHRATPSAP
TEQDGDPLDP GSYAWCDAEG EHAEWQRARA AEIKAAIQGG FQQDSACLVV DDIAGRIKDL
AQAWACLAEQ QGQWVDENAV ALFSARTIEG LMSLNLAPHI ANAGDGNIPE WLSEASYAER
NDLESLAELY SQHREAIEQV TARSGGHPGF IGGALAPNGI VDVIRRQAME DFLAYATPMQ
AQWQSEYHRI GADLAAILPT WHTQALLLDR EEEHHILLTC LLEKQAVETL LACGQEDFLS
SYYAGDDPVP AHLMHYVPTA SFVEGFLSQN TGLQKALTQA SALMGAQGAL SRYQQWRGEV
EHQTGLRFRS VEGLSDEART AIAGEVQIKE KLLGQAVLGT LLDDVQDVDL GQRITTLASR
LPDGQRLMFA ERLGLLELGW AIPDRSVLGR IQQALDQADS AVANLATLER RLEQLYQERQ
VELARASRRG TSQAHRRAAD RFNARKIRES QADINRHRRL LGEAFDALAE HSFPVDKANG
HAVQVGGLSL AATRAALAER AADRAVARQP AATLNDTLNT LVRDEQGELS ASRSLGVLVN
GSLSMLGMIM TGVALRNTLD AWGEERLMEH AFATGSHATG AAASIMAIRE MIIDARHRNL
YQGRGFQQVA LAETRAAAGS PAQLERWARV ANGAMGAVAL LGGFAGLLET YKQYKRMQGS
ETQAERLALQ VAFTGAAGVA GGGFLIGGMS ALGRMLGKPA VAWRLLLLKF AGPAGWVVAV
GTALLIIGEV LANRFSLSPV QRWCQRSHWG REDQGWDREA HERELARLGD TDLTVERQGQ
AEPHGGPGPG PAGTDLAIRI GLPGLDAPNA ENLALGLWGV TPRLKEMTRD FLEHAELENQ
GSSYALHYHF DPETLAECHE FRLVIRTKGP EASTTRVFQL HRRGTSLSDE WREISALGDR
FLTRYQVGNW PDMPLTPWP