Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2780 |
Symbol | |
ID | 4269714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 3161183 |
End bp | 3162343 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638127542 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_743610 |
Protein GI | 114321927 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTTGC CCGATTATCG CGGCGGCAGC ATCGTCAATC TGATGGCCAG CATCAGCCGG GGCCTGGGGG CGCCGCGGAT TGGCATCCCG GAGGCGCGAC TGCTGCCGGC GCATCAGATC GGCCGGGCCC GCCACGTGGT TTTGCTGATG CTGGACGGGC TGGGCTATGA GTATCTGGCC GGGCATCGCG ATGCCTTGCT GCGCCAACAC CTGCTGGGCG AACTCACCTC GGTCTTCCCC TCCGCCACCG CCATCGCCGT GACCAGCTTT GCCACCGGCT TGACCCCGCG CCAGCACGGG GTCACCGGCT GGCACATGTA CCTGGAGGAG CTGGGCCGGG TCTGCACCAT ACTGCCCTTC CGCGACCGGG TCACCCGTGA GTCCATCGCC GAGGTGGATC CGGACGCCGT GCGGGTGCTG GAGCAGCCGC CGCTGGCCAA TCTGCTCGAC GCTGAGACCC ACCTGGTGAT GCCGGCGGAG ATCGCCGACT CCACCTACAA CCTGGCCACC GGGGGGCTGG CCTGGCGGCA CGGGGTGGCG GACCTGGCGG ACTATGTCGA CACGGTGGCG GGGCTGGTGC AGTCGGCGGG CGGGCGCCAG TACATCTATG CCTATTGGCC CCGCCTGGAC AGCCTGGCCC ACCAATACGG GATGGCCAGC ACCGAGGTCC AGGCCCACTT CGCCGCCCTG GACGCCGCCT TCGACGAGCT GCGCCGGGAG CTGGCCGGCA CCGACACCCT GTTGTTGGTC ACCGCCGACC ACGGGCTTAT CGACATCACC CCGGACGGGG TGCTGGAGGT GGCCGACCAT CCCGCGCTGG AGGAGACCCT GGCGCTGCCC ATCTGCGGTG AGCCCCGGGC CGCCTATTGC TATGTCCGCC CGGGGCGGGA GGAGGACTTC CTCAACTACG TGCAGGGGCC GCTTGCGGGC TGGTGTGATG TCCACACCCC TGGGGAATTG CTGCAGGCGG GGTGGCTGGG GCCTGGCCCG GCCCACCCCC GGCTGTCGGG GCGGTTGGGC GACTATGTGC TGGTGATGCG TGACAACCGG GTGATCCACC AGCGGCTCAG CGGCGATGAG CCATTCTCCC AGATCGGGGT GCATGGCGGC ACCAGCGGCG CGGAGATGCG AGTGCCGCTG ATGGCCGCAC ACTGCGTTTG A
|
Protein sequence | MILPDYRGGS IVNLMASISR GLGAPRIGIP EARLLPAHQI GRARHVVLLM LDGLGYEYLA GHRDALLRQH LLGELTSVFP SATAIAVTSF ATGLTPRQHG VTGWHMYLEE LGRVCTILPF RDRVTRESIA EVDPDAVRVL EQPPLANLLD AETHLVMPAE IADSTYNLAT GGLAWRHGVA DLADYVDTVA GLVQSAGGRQ YIYAYWPRLD SLAHQYGMAS TEVQAHFAAL DAAFDELRRE LAGTDTLLLV TADHGLIDIT PDGVLEVADH PALEETLALP ICGEPRAAYC YVRPGREEDF LNYVQGPLAG WCDVHTPGEL LQAGWLGPGP AHPRLSGRLG DYVLVMRDNR VIHQRLSGDE PFSQIGVHGG TSGAEMRVPL MAAHCV
|
| |