Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1199 |
Symbol | |
ID | 4270687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1398723 |
End bp | 1400129 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125948 |
Product | hypothetical protein |
Protein accession | YP_742038 |
Protein GI | 114320355 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0215661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.10586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCC TGCGCCTCAC TCTCAACTGG TTGCGCTTCC GTCCCCTCCC CGCGCTGCTC AATCTGCTGT TGATGGCGCT GGGCACCGGC ACCATTGCCC TGCTACTGCT ATACGGCCAT CAACTGGAGC GCCAGTTCAC CCGCGATGCC CAGGGTATCG ACCTGGTGGT GGGCGCCAGT GGCAGCCCCA TGCAGCTGAT CCTGTCCAGC GTTTATCACC TGGACATCCC CACCGGCAAC ATCCCTGAAC GAACCGCCCG CGAGCTGGCG GACCACCGGC TGGTGAGCGA GGTCATCCCC CTGGCCCTGG GTGACAACTA CCGCGGCCAC CGCATCGTGG GCACGGACGC GGGCTACGTG GATCTCTACC GGGGGGAACT CGCCGAGGGG CGCCTTTGGG AGCAGGCCAT GGAGGCCACC CTCGGATCCG CGGTGGCAGC CCGCCATGGG CTCGCCATCG GCGATGAGAT CGTCGGCGCC CACGGCCTGG GTGGCGGCCA TGGCCATGTC CACGACTACG CCCCCTACAC CGTCGTCGGC ATACTGGCCC CCACCGGCAC GGTCATGGAC CGGCTGGTCC TGACCTCGGT GCAGAGCGTC TGGGACGTCC ATGACGACGA CCACGATCAC GACCACGATC ACGACCACGA TCACGACCAC GATCACGACC ACGATCACGA CCATGAGCAC GACCATGAGC ACGACCACAG CGACCACCCT GAGGCCGGCC ATGGCCATGA CCATGAGGCC GACCGGGCTC ACACCCACGA GCCGGCCCCG GATCACGACC ACGGGCATCC GGCCGGGCAA GGGCATGACC ACCGGGAGCC GGGCGCCGGC CACGACGATG TGGCCACCGC AGACCAGGAA CTGACCGCCC TGCTGGTCCG CTACCGCTCA CCGCTGGCTG CCATGCAACT GCCCCGCGCC ATCAACGCCG AGGCCGGCCT GCAGGCCGCC TCTCCGGCCT ACGAGAGCGC GCGCCTCATG AGTATGATGG GTGTCGGCCT GGACACCCTG AAGGCCTTCG GCGGGGTGCT GCTGCTGGCC GCCGGCCTGG GCGTCTTCAT TGGCCTCTAC AACGCCCTGC GGGAACGGCG GCACGACATC GCGATCATTC GCAGTCTCGG CGCCTCGCCC CGACTGGTGA GTGGCCTGGT GCTCCTGGAA GGCCAGCTTC TGGCCCTCAC AGGCACCCTA TTGGGCCTTG CCGGCGGCCA CCTGTCGGCC GAGTTGATCG GCCGCTGGAT CGGCCGCGAC CGGCCGCTGG AGCTGACCGG CCTGACCTGG GTGCCGAGCG AGGGCTGGCT ACTGCTGATC GCCGCCGGCA TCGGCCTGGT CGCCGCCCTG CTGCCGGCCT GGCAGGCCTA TCGAACCGAC ATCGCCCTGA CGTTGTCTGA GCGGTAG
|
Protein sequence | MNALRLTLNW LRFRPLPALL NLLLMALGTG TIALLLLYGH QLERQFTRDA QGIDLVVGAS GSPMQLILSS VYHLDIPTGN IPERTARELA DHRLVSEVIP LALGDNYRGH RIVGTDAGYV DLYRGELAEG RLWEQAMEAT LGSAVAARHG LAIGDEIVGA HGLGGGHGHV HDYAPYTVVG ILAPTGTVMD RLVLTSVQSV WDVHDDDHDH DHDHDHDHDH DHDHDHDHEH DHEHDHSDHP EAGHGHDHEA DRAHTHEPAP DHDHGHPAGQ GHDHREPGAG HDDVATADQE LTALLVRYRS PLAAMQLPRA INAEAGLQAA SPAYESARLM SMMGVGLDTL KAFGGVLLLA AGLGVFIGLY NALRERRHDI AIIRSLGASP RLVSGLVLLE GQLLALTGTL LGLAGGHLSA ELIGRWIGRD RPLELTGLTW VPSEGWLLLI AAGIGLVAAL LPAWQAYRTD IALTLSER
|
| |