Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1143 |
Symbol | |
ID | 4269638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1337415 |
End bp | 1339508 |
Gene Length | 2094 bp |
Protein Length | 697 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638125892 |
Product | hypothetical protein |
Protein accession | YP_741982 |
Protein GI | 114320299 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT CCATCGCCCC CCAGGTCTGG CCCGCGGGAC CGCGCGGCCG CGACGATGAC CTCTTCCACG AAGACCCGCG CCCCCTGTGC GCAACGTTCC TGTGGGCGGA CATCGTCTAT CTCACCGGCT CCGGCGAGTT CTGGCTGCTC AACGCCACAG CCGCCGCCGC GATGCACCAC GCGGCCGACA AGCTGGCCGA CATCGCCGCC GTGGACGACC GCGACGAGCG CAACCGGCGG CTGTCCGAGG AGGCGGGCGT GCTGGACAGC TTCCTGCCAG CGCATCCGGT CAGTTTCCTG GGGGAAGCGG ACCGCCAACG GTTCGCGCAG ACCCTCCAGC AGCTCGCCGC CCTTCAGGAC GAGGCCCCGG ACACGCTGCT GCAACGGGTG GTGGACGGCG TGCCTTTCCA GGGCACCACC AGCGTGAGCA CCTCGCAATC CTGGTCCGGC GATCATTCAA TGCCTTCGGC CGTACCGACC CAATGCCCGG AACCGGCACG CATCCACGAG AGCAACGACC ATCTCGACGC ACTTCAGGCC CTTTATCAAC GCGGCCTGGA CAAGGCCGAG AAGGCCGGCT ATGTCGTCGA TAGCGCCCTG GTCCACGGCG ACAGCGAGGC GCGCATCCGC GAGGCGCTGC AGCGCTATCA CCGGCGGCGG GAGCTTGCTT TCCAGGGTGC TCGGTCCCAC CTGGAACAGG GTCAGGGCCT GCCACCGACG CGGCCGCTCC ATAAGATCCT GGAGCAATAC CGCCGGCATG TGGCGCTCTG CGACCAGGAC CCGGTACCAG AGGCCGTCGA ACGCTGCGAG ATCGCCTCCG TGATCGAGCA CTACATCCCG CAACTGGAGC AGGACTACCG CCACTACATC GACAGTCTCA TCGAGCTGGC CGGGCTGGGC GTGGCCACCC CCGAGCTGGC GCTGGCTGAG GACCCGGACG CCGGCTTCGC CGACGGCGTG GACTACGTGG CCCGCTACTT CGCGACCCTG GACGAACTCG ACGCCTTGCG CGAGGACGTG GACACGCGGC TCAGGGAGTG GGAACAGGGC ACGGGCCGGG CCACCCCGCT GCCCATCTTC CTGTTCACCG ACGAGCAGGC GCGGTTCGAC CGGCTGCGCG AGCGCATGGA CCGCCTCTAC CGCACGGCCC GGCGCCGGGT GGACCGCACG CGCCCGAGGC GCGTCCTGCA CTGGGACCTG GGCCCCGACG ACATCCGCGA CCCCGAACCC TACCGCCCGC CCCCGATCCA CCGGCTGGTG CGCGCCGACT TCCCCCTGCG CGAGTTCAGC GGCCCCGGTC GGCAGCGCAC CCTCGATCAC CTGAGCCTGC ACCAGCTGGG CGAGACCCGC CCCCACTACG CCCGCCAGCG CGATGCCGCC ATCGCGCACG ACAGCCGCAC GGTCACCGAG CCCCGCTCCC TGCCCGACAC GGCACTGACC GGGTGGCTCA CCCGACGCGG CTGCCGGCGG CTCGACTGGA ACCCCGACTG GCACAGCGAG CCGCTCGGGC TGTTCGAGCC GGAACGCTTC TTCCACGACC TCGACCACCA GGGCCTGGTC ATCGACCGCC TTGCCGACGA CAGCGCCCGG GAGGAGTGGG GCCGGCGCCT GCGCCGGATC CTGTTCGCCG ACCCGCTCAA CCACCCCATG CGCCTGTTCG ATGCCAGTGG CCCGGCGCAG CTCCTGCGCC TGCTGGCCGG CGCATACGCC GAGCCCGACC GGCGCGACCG GGCGCTGTCC GGGGAGGCAC CCCTGTGGCT GCGCCGGCCC GGACCCGTGG CGCAGGCCGA ACCGGAGACA GCCGCTTCCG GAACCGGTGC CCGCATCGGC CTGACCGCCC GCTACCAGGA CACCGCCGGC ATCGACACCG GGGGCAACGC CGGCGGCCGG ACGGTCAGTG TGGCCCCCAG CCTCGCTCTC GACGAGCTGG GCATCACCGC CACCGTGGCC CCGCGGGCAG GACGGCTCGC GGGTCGTGTT GCGATTGCGC GGCCAGCACG CCTTCGACTT CGGCCGTGGC GAGATCGCCC TCGCGCCCAT CCAGCTGCCC GATCCCGCCA AGGCGGAGCC GGTCATCGTC CCCTTCGGCC TTGA
|
Protein sequence | MTDSIAPQVW PAGPRGRDDD LFHEDPRPLC ATFLWADIVY LTGSGEFWLL NATAAAAMHH AADKLADIAA VDDRDERNRR LSEEAGVLDS FLPAHPVSFL GEADRQRFAQ TLQQLAALQD EAPDTLLQRV VDGVPFQGTT SVSTSQSWSG DHSMPSAVPT QCPEPARIHE SNDHLDALQA LYQRGLDKAE KAGYVVDSAL VHGDSEARIR EALQRYHRRR ELAFQGARSH LEQGQGLPPT RPLHKILEQY RRHVALCDQD PVPEAVERCE IASVIEHYIP QLEQDYRHYI DSLIELAGLG VATPELALAE DPDAGFADGV DYVARYFATL DELDALREDV DTRLREWEQG TGRATPLPIF LFTDEQARFD RLRERMDRLY RTARRRVDRT RPRRVLHWDL GPDDIRDPEP YRPPPIHRLV RADFPLREFS GPGRQRTLDH LSLHQLGETR PHYARQRDAA IAHDSRTVTE PRSLPDTALT GWLTRRGCRR LDWNPDWHSE PLGLFEPERF FHDLDHQGLV IDRLADDSAR EEWGRRLRRI LFADPLNHPM RLFDASGPAQ LLRLLAGAYA EPDRRDRALS GEAPLWLRRP GPVAQAEPET AASGTGARIG LTARYQDTAG IDTGGNAGGR TVSVAPSLAL DELGITATVA PRAGRLAGRV AIARPARLRL RPWRDRPRAH PAARSRQGGA GHRPLRP
|
| |