Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1732 |
Symbol | |
ID | 4268981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1981426 |
End bp | 1982544 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126490 |
Product | hypothetical protein |
Protein accession | YP_742568 |
Protein GI | 114320885 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.105491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.994205 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCACC GCACCGGCGG TGTGCGCCCG TCTGAACCCG GAGACAACAG CGCGCCCATG GCACAGGAAT GGATAGACAG CCCGTCCGAT GGTCCGGGTC CGGTCCCGTT GCGGGACAAG GTGGCCTTTC TCGCCGACAG CGATGCCTAC CCCGACGTCC AGGGCAGGGT AACGGTGCGC GAGACCCATA TGTCCTGGGT GTTCCTGGCC GGAGATCGGG TCTACAAGCT GAAGAAACCG TTGCGGTATC CCCGCCGTGA TGGGCGCTTT TTGTGGGGTC GGGAGTTTCT CTGCCGCGAG GAGGTCCGGC TCAACCGGCG GCTGGCGCCG GATGTCTACC TCGGTGTCTC GGCGCTGACG GTGACGGCCG ATGGGCGGCT GGCGTTGCAG GGCGGGGGGC GGACGGTGGA CTGGCTGGTG GTCATGCGGC GGCTGCCGGC GGAGCGGATG CTGGATAACG CCATCGCGGC CGGGGCGGTG GACCGGGAGC GGATCGACCG GCTTGGTGAG CGGCTGTCAG GGTTCTACCG AGGCCTGGCA CCGGAGCCGG TCTCGCCGCA GACGTATCTG CAGCGTTTCG CTGGCGAGCA TGCGAGGAAT CGGGCGGTGC TCACCGACCC GCGTTTCGCG CTGCCCAAAG GCGCCCTCCA GCGGGTATTG GACGCGCTGG ACCGGTGGCT GGAGGCGGGG GCGCCGGCCC TGCGTGAGAG GGCGCGGGCG GGGCGAATCG TGGAGGGGCA TGGCGATCTG CGCCCGGAGC ACGTCTGCCT GTGTGATACC CCGGTAGTGA TCGACGGCCT GGAGTTCAGC CTTGCCCTCC GTCAGGTAGA CCCTTTCGAC GAGCTGGCCT TCCTGGGTAT GGAGTGCGAC CGGCTGGGGG CGCCCTGGAT CGGCCCGCGA CTGATCCGCC AGTGCGGCGA GGCCCTGGAT GACCACCCCG GCGCCCCGCT GCTGGCCTTC TACACTGCGT ACCGGGCTTG CCTGCGGGCG CGCCTGGCGC TCGCCCACCT GCTGGAACCC GACCCGCGCA GCCCGGAGCG GTGGGTGCCG CTGGCCGCCG ACTACCTGTC CCTGGCTGAA AGGGCCGCGG CTAACCTGGG CCCTCCAGCA ACTCGGTGA
|
Protein sequence | MRHRTGGVRP SEPGDNSAPM AQEWIDSPSD GPGPVPLRDK VAFLADSDAY PDVQGRVTVR ETHMSWVFLA GDRVYKLKKP LRYPRRDGRF LWGREFLCRE EVRLNRRLAP DVYLGVSALT VTADGRLALQ GGGRTVDWLV VMRRLPAERM LDNAIAAGAV DRERIDRLGE RLSGFYRGLA PEPVSPQTYL QRFAGEHARN RAVLTDPRFA LPKGALQRVL DALDRWLEAG APALRERARA GRIVEGHGDL RPEHVCLCDT PVVIDGLEFS LALRQVDPFD ELAFLGMECD RLGAPWIGPR LIRQCGEALD DHPGAPLLAF YTAYRACLRA RLALAHLLEP DPRSPERWVP LAADYLSLAE RAAANLGPPA TR
|
| |