Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0907 |
Symbol | |
ID | 4269292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1025247 |
End bp | 1026548 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638125659 |
Product | hypothetical protein |
Protein accession | YP_741751 |
Protein GI | 114320068 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.68427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCGC ACAAGAACAT CTTCGTATTC GGGCTGGACG AGGAGAACCA CGCGTTTCTC AAGCGCGTCC GGCACGCGGA GCACTATACG TTCCACGGCC TGCTGAACCG CTCCGAACTG GTGGAGCGGG ACGATTACGA CATCCCGCAC CTCATCGAGA AGGTGCGGAC CCGGCTGAAT GGCTTCGATG GGTCCGTGGA CGGCCTGATC CACTACATCG ATTTTCCGGT GAGCACCACG GTACCCCTGC TGGCCCGGGA GTACGGGCTG CCTTCGGCGT CCCTGGAGGC GGTGCTCTGC TGCGAGCACA AGTACTGGGC GCGGGTGGAG CAGGCCAGGG TGATCCCCGA TGCCTGCCCG CCCTTTGCCG CCTTCGACCC CTTCGACGAC CGTGCCCGGG CGCGGCTCGA GGCGGAGATC GGCTACCCGT TCTGGGTCAA GCCCATCAAG TCCTTCTCCA GCTACCTGGG TTTCCGTATT GACGGACCGG AGGACTTCGA GCACGCCCAG GCGCGGATGC GGGCGGGAAT CGGTCGCTTC GCCGAGCCCT TCGACTACCT GCTGGACCAG GTGGAACTCC CCCCGGAGGT GCGCGGCATC GGCGGCGGCC ACTGCCTGGC CGAGGGCATC ATTGGTGGCC ATCAGTGCAC CCTGGAGGGC TATGGCTACC AGGGCCATGT GTATGTCTAC GGTGCGGTGG ACTCGGTCCG CGAGCCCAAC GGCTCTTCAT TCCGGCGCTA CCAATACCCG TCGGTGCTGC CCGAGCCGGT CCAGCAACGG ATGATCGAGC AGGCGCGGCG GTTCATGACC CACATCGGCT ACGACAATGC GCCCTTCAAC ATTGAGTTCT ACTGGGACGA AGCCACAGAC GATGTCTGGC TGTTGGAGGT CAACACCCGG CTGTCCCAGT CCCACTGCGA CCTGTTCGAG AAGGTGGACG GGGTCTCGCA CCAGGAGGTG GCCGTTGACC TGGCGCTGGG TCGGGCGCCG GAGTTCCCGC AGGGGCGGGG CGAGTTCCCC ATGGCGGCGA AGTGCTTCCT GCGGGTGTTC GAGGACGGCA AGGTCACCCG GGTGCCCAGT GCATCGGAGG TGCGCGCTTT GGAGGAGGCC TTCCCCGGGA CCCGTATCCA GATCCAGGCT CGCGAGGGTG CCTGGCTGTC CCAGCTCTGG GATCAGGACA GTTACAGTTA CTGCCTGGCG CTGATCTTCC TGGGCGGCGA GAATGAAGAG GACATAGAGG CCCGTTTCGA ACACATCCGC GAGGGCCTGG ACTTTCGGAT CGAGAAGCCG GAGGCAGCGT GA
|
Protein sequence | MVAHKNIFVF GLDEENHAFL KRVRHAEHYT FHGLLNRSEL VERDDYDIPH LIEKVRTRLN GFDGSVDGLI HYIDFPVSTT VPLLAREYGL PSASLEAVLC CEHKYWARVE QARVIPDACP PFAAFDPFDD RARARLEAEI GYPFWVKPIK SFSSYLGFRI DGPEDFEHAQ ARMRAGIGRF AEPFDYLLDQ VELPPEVRGI GGGHCLAEGI IGGHQCTLEG YGYQGHVYVY GAVDSVREPN GSSFRRYQYP SVLPEPVQQR MIEQARRFMT HIGYDNAPFN IEFYWDEATD DVWLLEVNTR LSQSHCDLFE KVDGVSHQEV AVDLALGRAP EFPQGRGEFP MAAKCFLRVF EDGKVTRVPS ASEVRALEEA FPGTRIQIQA REGAWLSQLW DQDSYSYCLA LIFLGGENEE DIEARFEHIR EGLDFRIEKP EAA
|
| |