Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2520 |
Symbol | |
ID | 4270159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2863838 |
End bp | 2864857 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638127279 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_743350 |
Protein GI | 114321667 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.710279 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTAC TTGGTGTGGA AAGCTCCTGC GACGAGACCG GCCTGGCCAT CTACGACAGC GCCCAGGGCC TGATGGCGCA CGCCCTGCAC AGTCAGGTGG CCACCCACGC GGAATACGGC GGCGTGGTGC CGGAGCTGGC CTCCCGCGAT CATGTCCGGC GGGTGGTGCC ACTGACCCGG CGGGTACTGG CCGAGGCCGG GTGCCGGCTG CGGGATATCG ATGCGGTGGC CTACACCCGC GGCCCCGGCC TGGTGGGCGC GCTGATGGTG GGCGCCGGCA TGGCGCGCAG CCTGGCCTGG GGGCTGGGGG TCCCAGCCCT GGGCGTACAC CACATGGAGG CCCACCTGCT CGCGCCCATG CTGGAGCCAA ACCCGCCGGC CTTCCCCTTC GTGGCCCTGC TGGTCTCCGG CGGCCACACG CTATTGGTCC AGGTGGCAGG CGTGGGCCGC TACCGCGTGC TGGGCGAGAC CCTGGATGAC GCAGCGGGCG AGGCCTTCGA CAAGACCGCC AAGCTGCTCG GCCTGCCCTA CCCTGGCGGT CCGGAGCTGG AGAAACTCGC GGAGTCGGGT GACCCGGGGC GCTACCGCTT CCCCCGGCCG ATGACCGACC GCCCCGGGCT GGATTTCAGC TTCAGTGGGC TCAAGACCCG GGTGCTGCAG ACCGTGCAGC AGAGCCGGGA GGCGGACCGG GCGGACATCG CCGCGGCCTT CCAGTCGGCG GTGGTGGATA CCCTGGTTAT CAAGTGCCGG CGGGCGCTGC GGGCGACCGG CAGCCAGCGG CTGGTGATCT CCGGCGGTGT GGGGGCCAAT GGTCTGTTGC GTGAGCAGAT GCGCGCCATG GCGGATCAGG CGGGGGCCAG CCTGCATTAC CCGCGGTTGG CGCTGTGTAC CGACAACGGC GCCATGGTGG CCTACACCGG CTGGTGCCGC CTGAGCGAGG GCCAGCACGA CGATCTGGAC TTCAGTGTCA CCGCCCGCTG GCCGCTGGCC GATCTGACCC CGCCCGGGCA GCCGGTCTGA
|
Protein sequence | MRVLGVESSC DETGLAIYDS AQGLMAHALH SQVATHAEYG GVVPELASRD HVRRVVPLTR RVLAEAGCRL RDIDAVAYTR GPGLVGALMV GAGMARSLAW GLGVPALGVH HMEAHLLAPM LEPNPPAFPF VALLVSGGHT LLVQVAGVGR YRVLGETLDD AAGEAFDKTA KLLGLPYPGG PELEKLAESG DPGRYRFPRP MTDRPGLDFS FSGLKTRVLQ TVQQSREADR ADIAAAFQSA VVDTLVIKCR RALRATGSQR LVISGGVGAN GLLREQMRAM ADQAGASLHY PRLALCTDNG AMVAYTGWCR LSEGQHDDLD FSVTARWPLA DLTPPGQPV
|
| |