Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1720 |
Symbol | |
ID | 4268969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1967321 |
End bp | 1968619 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638126478 |
Product | hypothetical protein |
Protein accession | YP_742556 |
Protein GI | 114320873 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.26756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.377885 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGC ACATCTTCGT AATCGGGCTG GACGACTTCA ACCTCGCTGA ACTGCAGACC GTCCGCAACG CCGGGGAGTA CACCTTCCAC GGCCTGGTGG ACTACGACAC CATGGTGTTG CCCGAGTCCT ACCCGATGCC GGAGATCATG GCCGAGGCCC GCCGGACCCT CGCGGATGCC CCCGCGGTGG ACGGCATCAT CGGCCACTGG GACTTTCCCA CCACCTCCAT GCTGCCCATC CTGCGCCGGG AGCACGGCCT ACCCACACCC ACCCTGGAGA GCGTGCTCTA CTGTGAGAGC AAGTACTGGA ATCGACTCGC CTGCGAGCAG GCGGTGCCCG AGTGCACGCC CGACTTCCAG GGGCTGGACC CGTACAGCGA CGACCCGCTG GCCGACCTGG ACGTGGCCTA CCCCTTCTGG CTCAAGCCCA CCGTGGCCTT CTCCTCCTAC CTGGGCTTCC GCATTGAGAA CGAGCAGCAG TACCTGGACG CCATGGCCAC CATTCGCGAG CACATTCACG TGTTCGCCGA ACCGTTCGAC TACATCGTCG AGCAGTGCCA GAACCGCGCC GCCCTGCCCG ACCGCGGCAG CGGCGCCACC TGCATCGCCG AGGGGCTGAT CGGTGGTCGG CTCTGCACCC TGGAGGGGTA CGTGCACAAC GGCGAGGTGG TGGTGTATGC GGTGGTGGAC TCGCTGCGGG CCGCCAATAA CGTGAGCTTT TTCAGCTACC AGTACCCCTC CCAGCTCCCC GCCGGGGTCC GCAACCGCAT GATCGGACAC GCGGAAAAGC TCCTTCACCA CATCGGGTTG GACCACACGC CGTTCAACAT GGAGTTCTTC TGGGATGAGG CCATCGACAA GATCTGGCTG CTGGAGATCA ACGCGCGGAT CTCCAAGTCC CACTGCCCGA TCTTCCAGAT CGCCACCGGG GCCTCCCACC ACGAGGTGGC CATCGACATC GCCCTGGGCC GGCGGCCGGA CTTCCCCCGT CCGGAGGGCC GTTTCCCCAT GGCGGGCAAG TTCATGCCCC GCGTGTTCGC CGACACAGTG GTGACCCGGG TGCCCTCCGA GGAGGAGATC CAGGCGCTCA AGCGGGTCCA TCCGGAACTG ATCGTCCACA TCGCCATAGA GGAAGGGATG CGGTTGTCGG AACTGCGCGC CCAGGACAGC TACAGCTTTG AGATCGGCGA TGTCTTCCTG GGGGCGGCGG ACGAAGCCGA ATTGCACCAG AAGTTCCGCC ACATCATGCA GGCCCTGGAC TTCCAGTTCG CTGACGTGGT GCCGACCAAC TACAGCTGA
|
Protein sequence | MTQHIFVIGL DDFNLAELQT VRNAGEYTFH GLVDYDTMVL PESYPMPEIM AEARRTLADA PAVDGIIGHW DFPTTSMLPI LRREHGLPTP TLESVLYCES KYWNRLACEQ AVPECTPDFQ GLDPYSDDPL ADLDVAYPFW LKPTVAFSSY LGFRIENEQQ YLDAMATIRE HIHVFAEPFD YIVEQCQNRA ALPDRGSGAT CIAEGLIGGR LCTLEGYVHN GEVVVYAVVD SLRAANNVSF FSYQYPSQLP AGVRNRMIGH AEKLLHHIGL DHTPFNMEFF WDEAIDKIWL LEINARISKS HCPIFQIATG ASHHEVAIDI ALGRRPDFPR PEGRFPMAGK FMPRVFADTV VTRVPSEEEI QALKRVHPEL IVHIAIEEGM RLSELRAQDS YSFEIGDVFL GAADEAELHQ KFRHIMQALD FQFADVVPTN YS
|
| |