Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1139 |
Symbol | |
ID | 4269634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1332466 |
End bp | 1333476 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638125888 |
Product | hypothetical protein |
Protein accession | YP_741978 |
Protein GI | 114320295 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.882872 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTGCA CAAGAACAAT GGTTTTCCTC TTGATCGTTG GCTTATGGGC CGGGACGGCT CAAGCCAACA AAGGCGAAGA CACAACATCC CCAGAGCTCG ATGCCGAGCA ACAACAGGCC AAGGAAGAAG GCATGCGCCT GTACGGCATC CGCTGGCGCA GCGTGGCCAT CCACCACCTG GAAAAGGCCG CCGAGGCCGG CGATGTTGAG TCCATGTACA CCCTTGGCGA GATCTACCGC TTTATGGACC GTGGCATGTC CCACGAGGCC ATCGACTGGT ACCACCGCGC GGCGGAGGGC GGGGATCCCT ACGCCATGCT TCGTCTGAAT TGGGGCATGA TCTGCGAGCT GGCCGACATC TGCCCCGAAG AGCATGACAC CTGGGCAGAA ATGGCCCTGG GCCAGGAACT CCCCAAAGCC GAGGAAGGGG ATCCGGATGC CATGCTTGCA CTGTATTCGA TCTATGTTGC GCTGGAGGAG GTGGAAGAGG GTCGGAACTG GTTACGCAAC GCTGCCAGGG CGGGCCTCCC ACAGGCACAA GACCTGTGGG CGAGTCGTAT TCAGGAGCGC TCCGGCGAAT GGCCCCCGCC GCTGGAAGAC GTCAAGGCCG CCGAGCCCTG GTTCCGCAAG GCCGCCGAGC AGGGCTACGC CCCGGGGATG TACAACCTGT CCCTAGCCTT GCGGGATCAG GAGCGGTATA ACGAAGACTG GAAATGGACG AAAAAAAGTT CCCGACATGG CCATATCAGC GGTCGTCTCG CCGTTGGCTG GTGCTACCTG GATAATACCT GGGCGGATTT CTGCCCGGAC GACGCAGATG ACACGGTCAA GGGTTGGGCC ATACTTCACG CGGTTTATGA AGAGACGCGA GATAGCACGG CCGAGGGCAT TCTTGGGCGA GAACGCGACC GCATGACCGA AGATGAAATC GCCGAAGCCG AAGAACTCGC CGAGGAGTGG CTGAACCGCG AGCCCCCGCT GTCCTACTTC CCGCCCAAGT ACGGCCTGTA G
|
Protein sequence | MPCTRTMVFL LIVGLWAGTA QANKGEDTTS PELDAEQQQA KEEGMRLYGI RWRSVAIHHL EKAAEAGDVE SMYTLGEIYR FMDRGMSHEA IDWYHRAAEG GDPYAMLRLN WGMICELADI CPEEHDTWAE MALGQELPKA EEGDPDAMLA LYSIYVALEE VEEGRNWLRN AARAGLPQAQ DLWASRIQER SGEWPPPLED VKAAEPWFRK AAEQGYAPGM YNLSLALRDQ ERYNEDWKWT KKSSRHGHIS GRLAVGWCYL DNTWADFCPD DADDTVKGWA ILHAVYEETR DSTAEGILGR ERDRMTEDEI AEAEELAEEW LNREPPLSYF PPKYGL
|
| |