Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0138 |
Symbol | |
ID | 8409635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 138809 |
End bp | 141622 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645018463 |
Product | FG-GAP repeat protein |
Protein accession | YP_003175983 |
Protein GI | 257386210 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.247842 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAACA CTTCCCAGAT CCGACTGCGG ATCGGACGGG CGACGGTCGT CGTCTTGATC GCCGCGGTCG TCGCGGCGAG TCCCATCGTC GCGAGCGTGA CAGCACTCGG TGACGACGCG AGCATCTCAC AGCGAACGGA CGATATCGTC GCCCAGCAGT CAGCGCCCAG TGGGTTCTCC GGCGAGACGA ACCTCTCGGA GGCGGAAACG AAGTACGTCG GCACTGCCGA AAACGATACC GCGGGCTGGT CGGTCGCGAA CGCGGGCGAC GTGAACGGTG ACGGAATCGA CGATCTCGTC GTCGGCGCAC CGGAGAACGA CACCGGCGGG ACCAACGCGG GGGCGGCCTA CGTCTTCTTC GGCCCGGCCG ATCCTGGCAC GGTCTCGCTG GCCGACGCCG ACGTGACCCT CGTCGGCGCG GCGGCCTCCG ACCGCGCTGG CTACGACGTG TCGTACGCGG GTGACGTGAA CGACGACGGC TACGCAGACG TGATCGTGGG CGCACCGGGC AACGACAGCA CGGCCAGCAA CGCTGGCGCG GCCTACGTCG TCTACGGCGG CGACACGATG GCCGACCGGA TAAGTCTCGC CGACGCGGAC GTGACGCTGA TCGGCGACTC GCCGGGCGAC CGTGCCGGGT GGTCGGTCTC GAACGCCAGC GGACTCGACG GTCCCGACGG GGTCGCCGTC GGCGCACCCT TCGCCAACGA CAGCGCCGGT GGGGCCTACC TCGTCTCCGG CGAGCAGCTG TTTGGCACCG TCGACCTCGG GGCGGAGTCG ACCGCTACGC TGACCGGCGA GTCGCCGGGC GACCAGGCCG GCTGGTCGAT CTCGCACGCT GGAGACGTGA ACGCCGACGG TACGGCCGAC GTGATCGTCG GTGCGAACAA CTACACGGCC GCCGACGGAC CGGCCGGGAG CGGGGCTGCA TACGTCGTCT ACGGCGCGGT CGGCGGCGAG CGGGATCTCG GCGACGCCGA CCTGCGACTC CGTGGCGTCG ACGGTGCGGA CCGCGCGGGC TGGTCGGTCT CGTACGCGGG TGACGTGAAC AACGACAGCA CTGCCGACGT GATCGTCGGC GCACCGTTTA CCGATCCCAA CGGAACGGTC GCGGCGGGAT CGGCGTACGT CGTCTACGGC GAGCCAGACA GGTCTGGCGA CGTGTCGCTG GCCGACGCCG ACGTGCGTCT GACCGGCGAA GGTGACCGCG ACCGGGCGGG CGTTGCGGTC TCGTCGGCTG GCTCCGGTGA CGTGACCTGT GACGGTGTCG ACGACGTGCT CGTCGGCGCG CCGCAGAACG ACTCGAACGG GAACGCCTCC GGAGCGGCCT ACGTCGTCGC CGGCAGCGAA TCGTTCTCGG GTAACATCTC GCTGAGTGAC GCCGACGCGA TCTTCCGCGG CGAAGCGGCC GGCGATCGAG CGGGCCGTGC GGTCGACGAC GTTGGTGATC TCGACGACGA CAGCTTCGAC GACATCGCGG TTGGTGCGCC ACGGAACGAC AGTAGCGCAA CTGACGCCGG AGCGGCCTAC GTGCTGAACA GCGACTGCGC AGTGCTCGAA ACGCCGACTG CGACGCCGAC CGAAACTCCG ACCGATACGC CGACTGACAC CCCGACTGAC ACTCCGACTG ACACCCCGAC TGACACGCCG ACGGATACCC CCACGGATAC GCCAACCGAC ACTCCGACCG ATACGCCAAC TGACACGCCG ACCGACACCC CGACTGACAC GCCAACCGAC ACCCCGACTG ACACTCCGAC CGACACGCCA ACCGACACCC CGACTGACAC TCCGACCGAC ACGCCAACCG ACACCCCGAC TGACACGCCG ACCGACACTC CGACCGATAC GCCAACTGAC ACGCCGACCG ACACTCCGAC CGATACGCCA ACTGACACGC CGACCGACAC TCCGACGGAC ACCCCGACGG ATACCCCCAC GGATACGCCA ACCGACACCC CCACTGACAC GCCGACGGAC ACGCCAACCG ATACGCCGAC GGACACCCCC ACCGACACGC CGACCGACAC TCCGACTGAC ACGCCGACCG ACACTCCGAC TGACACGCCA ACCGACACGC CGACTGACAC TCCGACTGAC ACGCCGACCG ATACCCCGAC CGATACGCCG ACCGATACGC CAACTGACAC GCCAACCGAC ACCCCGACTG ACACTCCGAC CGACACGCCG ACTGACACGC CAACTGACAC GCCGACCGAC ACGCCAACCG ATACGCCGAC GGACACCCCT ACCGACACGC CGACCGACAC TCCGACTGAC ACCCCGACGG ATACCCCCAC GGATACGCCG ACCGACACTC CGACTGACAC TCCGACCGAC ACTCCGACTG ACACGCCGAC TGACACGCCA ACCGACACGC CACAGAATCT CGCGGCGATC AGCTTCGTCG CGTTCTGTGT CCCGGGCGAA CAGGGATCGG GCAACGATCC GTGTCCCGAA GGCGAGCGGC TCCTGGTCAA ATTCGAGGAC CAGGGCGACG GGTCCTTCGC GCCAGAGGGC GGTGACGCGA TGGGCGTGAC TGTGACCCCA TCCGAGTTCA AGGACAACGA TCCGTCGGAA GTCGTCGCCG TCCAGTGGAC CTCCGGACAG TCGATCTCGA CGGTCGTCGT CAAGTCCTCG ACTGACGAGT GTAACTACCC CGGTGGGAGT TCGGGCACCG CAGAATCCTG CGGACCGCCG TCGGGCCAGA GTTCCCAGTC GGAACCCGGT GGCGGTGGCT CGGGGCCACT GCCGCCGATC TTCCTCGCCG GACTGGCAGC GACTTCGCTG GTGGCTGTCG GGCGGCGCGA CTGA
|
Protein sequence | MSNTSQIRLR IGRATVVVLI AAVVAASPIV ASVTALGDDA SISQRTDDIV AQQSAPSGFS GETNLSEAET KYVGTAENDT AGWSVANAGD VNGDGIDDLV VGAPENDTGG TNAGAAYVFF GPADPGTVSL ADADVTLVGA AASDRAGYDV SYAGDVNDDG YADVIVGAPG NDSTASNAGA AYVVYGGDTM ADRISLADAD VTLIGDSPGD RAGWSVSNAS GLDGPDGVAV GAPFANDSAG GAYLVSGEQL FGTVDLGAES TATLTGESPG DQAGWSISHA GDVNADGTAD VIVGANNYTA ADGPAGSGAA YVVYGAVGGE RDLGDADLRL RGVDGADRAG WSVSYAGDVN NDSTADVIVG APFTDPNGTV AAGSAYVVYG EPDRSGDVSL ADADVRLTGE GDRDRAGVAV SSAGSGDVTC DGVDDVLVGA PQNDSNGNAS GAAYVVAGSE SFSGNISLSD ADAIFRGEAA GDRAGRAVDD VGDLDDDSFD DIAVGAPRND SSATDAGAAY VLNSDCAVLE TPTATPTETP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPTDTPTD TPTDTPTDTP TDTPQNLAAI SFVAFCVPGE QGSGNDPCPE GERLLVKFED QGDGSFAPEG GDAMGVTVTP SEFKDNDPSE VVAVQWTSGQ SISTVVVKSS TDECNYPGGS SGTAESCGPP SGQSSQSEPG GGGSGPLPPI FLAGLAATSL VAVGRRD
|
| |