Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0934 |
Symbol | |
ID | 4268221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1058971 |
End bp | 1060140 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125686 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_741778 |
Protein GI | 114320095 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGT TGCTTTGGCT TTTGCTGCCC GTGGCGGCGA TGTCCGGGTG GTTGGCCGGG AGACGGAGCG GGGCCGGGCA TCGGGGCGGC GAGCAACGGG ACCTGCCCGA GGCCTATTTC CAGGGCCTCA ACTACCTCCT GAACGAGGAG CGCGACAAGG CGCTCGAAGT GTTCACCCAA ATGGTGGAGG TGGACAGCGA GACAGTCGAG ACCCACCTGG CGCTGGGCAG CCTGTTCCGG CGCCGGGGTG AGGTCGACCG CGCCATCCGT ATTCACCAGA ACCTCATCGC CCGCCCGGCC CTGAGCCGCC AGCAGCGCAC CTACGCCCTG CTGGAGCTGG GCGAGGACTA CATGCGTGCG GGCCTGCTCG ACCGGGCCGA GACGCTCTTC GAGGAGGTGA TCGACCTCAA CCACCACGTC GAACCGGCCC TGCGTCAACT GCTGGCCATC TATCAGCAGG AGAAGGAGTG GGACCAGGCC ATCGGCGCCG CCCTGCGCCT GGAAAAGGTC TCCGCCCAGA ACCTCCACCC CCAGGTGGCG CACTTCTACT GCGAGATGGC GGGCGAAGCC TGGGCGGCCG GCGATCTCAG CCGTGCCCGC ACCCTCTACA AGCGCGCCCT CACCCACGAC CCGCGGTGTG TCCGGGCGAG TATCCAGGCC GGGCATCTGG CGCGGCAGAT GGGGCATGCC CGGCAGGCGG TACGTTTGTA CCGGCAGGTG CCTACCCAGG CACCGGAGTT CGTCGGCGAG GTGCTCGATG GCCTGTACCA GGCGCTGGAG AGCCTGGGTC AGCTCCACCG CTACCCGGAG TTTCTCGATC AATTGCTCGC CACCGGCAAG GCCCCGGTGG CGGTGGCGCT GGCGAAAGTG GAGTGGCTGC GCGCGGAGGC TGGGCACGAG GCGGCGATGC GCTGGCTGGC CGAGCACCTT GAAGCCCAGC CCTCGGTGCG CGGCCTACTC CGGCTGGTGG AGATGAGCGA CGGCGCCCCC CCTGTGGCGG AGGGTCCGGT GGAGGCGGCA CTGCACCGGA CTCTCCGCGC GCTGCTGGAG GCGCGGGCGC AGTACCTTTG CGGGCAATGC GGCTTCACCG CCCGCACGCT GTTCTGGCAA TGCCCCGGCT GCAAGAGCTG GGGCAGTATC CGCCCCCTGC GTGGCGTGGA GGGAGAGTAA
|
Protein sequence | MPELLWLLLP VAAMSGWLAG RRSGAGHRGG EQRDLPEAYF QGLNYLLNEE RDKALEVFTQ MVEVDSETVE THLALGSLFR RRGEVDRAIR IHQNLIARPA LSRQQRTYAL LELGEDYMRA GLLDRAETLF EEVIDLNHHV EPALRQLLAI YQQEKEWDQA IGAALRLEKV SAQNLHPQVA HFYCEMAGEA WAAGDLSRAR TLYKRALTHD PRCVRASIQA GHLARQMGHA RQAVRLYRQV PTQAPEFVGE VLDGLYQALE SLGQLHRYPE FLDQLLATGK APVAVALAKV EWLRAEAGHE AAMRWLAEHL EAQPSVRGLL RLVEMSDGAP PVAEGPVEAA LHRTLRALLE ARAQYLCGQC GFTARTLFWQ CPGCKSWGSI RPLRGVEGE
|
| |