Gene Mlg_2678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2678 
Symbol 
ID4269553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3032302 
End bp3033309 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content65% 
IMG OID638127437 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_743508 
Protein GI114321825 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000277993 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000308835 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACACC TGATTACCGG CGCGGCCGGC TTCATCGGCT ACCACACCGC CCAGGCGCTG 
CTGGCTCGCG GCGACGAGGT CGTCGGCCTG GACAACCTCA ATGACTATTA CGACCCACGG
CTCAAACGGG CACGCCTGGC CCGGCTCGAG GGGCAGCCGG GCTTCCGCTT CGTCAAGCTG
GATCTGGCCG ACCGGGCCGG AATGGCCGAA CTGTTCCGCG CGGAACGCTT CCAACGGGTG
ATCCACCTGG CCGCCCAGGC GGGCGTGCGC CACTCGCTCA CCGACCCCTA CAGCTATGTA
GACAGCAACG TGAGCGGCAC GCTGAACGTG CTTGAGGGTT GCCGCTACAA CGACGTGGAG
CACCTCACCT ACGCCTCCAC CAGTTCGGTC TACGGGGCCC ACGAGGACAT GCCCTTCACC
GAGCACCGGC ATACCGACCA CCCGCTGGCC ATCTATGCGG CGACGAAGAA GGCCACGGAA
CACATGGCCC ACAGCTACGC CCACCTTTAC GGGCTGCCTT GCACCGGGTT GCGCTTCTTC
ACCGTCTACG GCCCCTGGGG CCGCCCCGAC ATGGCGCTGT TCCTGTTCAC CCGCAAGATC
CTCGCCGGTG AGCCCATCGA CATCTACAAC AACGGCGATC ACGGCCGGGA TTTCACCTAT
GTGGATGACA TTGTCGACGG CGTCATCCGC GCCTCTGACC GGGTGGCCCG CCGCAATCCG
GAGTGGGACC CGAAGCGGCC GGACACGGCC ACATCCAATG CCCCCTGGCG GATCTACAAC
ATCGGCGCCA ACCGTCCGGT CCGCCTGATG CACTACGTCG AGGTGCTGGA GGAGGCCCTG
GGACGCAAGG CGGAGAAAAA CTTCCTGCCG CTGCAACCGG GTGATGTGCC AGAGACCCAC
GCCGATGTCT CGGCGCTGGC CCAGGATACC GGGTATTCAC CCAAGGTGTC GGTGGAGGAG
GGCATCCGCC GCTTCGTCGA CTGGTACCGG GAATACCACC ACGTCTAG
 
Protein sequence
MKHLITGAAG FIGYHTAQAL LARGDEVVGL DNLNDYYDPR LKRARLARLE GQPGFRFVKL 
DLADRAGMAE LFRAERFQRV IHLAAQAGVR HSLTDPYSYV DSNVSGTLNV LEGCRYNDVE
HLTYASTSSV YGAHEDMPFT EHRHTDHPLA IYAATKKATE HMAHSYAHLY GLPCTGLRFF
TVYGPWGRPD MALFLFTRKI LAGEPIDIYN NGDHGRDFTY VDDIVDGVIR ASDRVARRNP
EWDPKRPDTA TSNAPWRIYN IGANRPVRLM HYVEVLEEAL GRKAEKNFLP LQPGDVPETH
ADVSALAQDT GYSPKVSVEE GIRRFVDWYR EYHHV