Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1725 |
Symbol | |
ID | 4268974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1973826 |
End bp | 1975019 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126483 |
Product | hypothetical protein |
Protein accession | YP_742561 |
Protein GI | 114320878 |
COG category | [R] General function prediction only |
COG ID | [COG2081] Predicted flavoproteins |
TIGRFAM ID | [TIGR00275] flavoprotein, HI0933 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.685186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCTC GCTCCCCATA CCCGCATTAC GATGTCATCG TGATCGGCGC CGGCGCCGCC GGGCTGATGT GCGCGCTCAC CGCCGGCGGG CGTGGCCGCC GGGTGCTGGT GCTGGACCAT GCCAACAAGG TGGGCAAGAA GATCCTGATG TCCGGCGGTG GGCGGTGCAA TTTCACCAAC ATCCACTGCG GGCCCGAGCA CTTTCTGTCG GCCAACCCGC ACTTCGTCAA ATCGGCCCTC AGTCGCTACA CCCCCTGGCA CTTCATCGCC TTGGTGGAAC AGCACGGCAT CCCCTACCAC GAAAAGAAGC TGGGCCAGCT CTTCTGCGAC CGCTCCTCCA AGGATATCGT CGGCATGCTG CTGGCGGAGT GCCGGGCGGT GGGGGTGGCG ATCCGTACCC GCTCGCCGGT CAGCGACCTG CGGCTGGGGG CACCGCACTG GCTGTCCACT CCGCAGGGCC CGGTGACGTG TTCATCGCTG GTGATTGCCA CCGGCGGCTA CTCGATTCCG AAGATGGGTG CCACCGGCTT CGGCTTCGAC CTGGCGCGGT CGCTGGGGAT TCCGGTACGG CCCACCCGTG CGGCGCTGGT GCCGGTGACC CTGGAGGGGC GCAAGCGGCG CCAGCTGCAG GACCTGGCCG GGGTGGCCCT GGACAGCGTC ACCCGCGCCG GCGGGGCCGC CTTCCGCGAG AATATCCTGT TCACCCACCG TGGGCTCAGT GGGCCGGCGA TCCTCCAGGC CTCGTCCTAC TGGCAGCCCG GCGAGCCGCT GGAGATCGAC CTGTTTCCCG ACACGGATCT GGCCGGGCAC CTGGAGGCGA TGCGCCGGGA GCGCCCGCGT CTGACCCTGA AGAAGCTGCT GGGCGAGCAA CTCACCCGCC GTGTGGCCCA GCGCTGGTGT GAACTCTGGC TGCCGGACAG GCGCCTGGAG CAGTTGACCG GCGAGGACAT ACGCCGGATC CAGCAGGCCT GCCAGCCCTG GACGGTCTGG CCCGATGGCA CCGAAGGGTA CCGTACCGCC GAGGTGACCC TGGGCGGGGT CGACACCCAT GCGCTATCTT CCAAGACCAT GGCCTGTCGT GATCACCCGG GGCTCTACTT CATCGGTGAG GTGGTGGATG TCACGGGCCA CCTGGGTGGT CATAACTTTC AGTGGGCCTG GGCGTCGGGG CATGCGGCGG GGCAGCATGT GTAG
|
Protein sequence | MASRSPYPHY DVIVIGAGAA GLMCALTAGG RGRRVLVLDH ANKVGKKILM SGGGRCNFTN IHCGPEHFLS ANPHFVKSAL SRYTPWHFIA LVEQHGIPYH EKKLGQLFCD RSSKDIVGML LAECRAVGVA IRTRSPVSDL RLGAPHWLST PQGPVTCSSL VIATGGYSIP KMGATGFGFD LARSLGIPVR PTRAALVPVT LEGRKRRQLQ DLAGVALDSV TRAGGAAFRE NILFTHRGLS GPAILQASSY WQPGEPLEID LFPDTDLAGH LEAMRRERPR LTLKKLLGEQ LTRRVAQRWC ELWLPDRRLE QLTGEDIRRI QQACQPWTVW PDGTEGYRTA EVTLGGVDTH ALSSKTMACR DHPGLYFIGE VVDVTGHLGG HNFQWAWASG HAAGQHV
|
| |