Gene Mlg_2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2520 
Symbol 
ID4270159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2863838 
End bp2864857 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content72% 
IMG OID638127279 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_743350 
Protein GI114321667 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.710279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTAC TTGGTGTGGA AAGCTCCTGC GACGAGACCG GCCTGGCCAT CTACGACAGC 
GCCCAGGGCC TGATGGCGCA CGCCCTGCAC AGTCAGGTGG CCACCCACGC GGAATACGGC
GGCGTGGTGC CGGAGCTGGC CTCCCGCGAT CATGTCCGGC GGGTGGTGCC ACTGACCCGG
CGGGTACTGG CCGAGGCCGG GTGCCGGCTG CGGGATATCG ATGCGGTGGC CTACACCCGC
GGCCCCGGCC TGGTGGGCGC GCTGATGGTG GGCGCCGGCA TGGCGCGCAG CCTGGCCTGG
GGGCTGGGGG TCCCAGCCCT GGGCGTACAC CACATGGAGG CCCACCTGCT CGCGCCCATG
CTGGAGCCAA ACCCGCCGGC CTTCCCCTTC GTGGCCCTGC TGGTCTCCGG CGGCCACACG
CTATTGGTCC AGGTGGCAGG CGTGGGCCGC TACCGCGTGC TGGGCGAGAC CCTGGATGAC
GCAGCGGGCG AGGCCTTCGA CAAGACCGCC AAGCTGCTCG GCCTGCCCTA CCCTGGCGGT
CCGGAGCTGG AGAAACTCGC GGAGTCGGGT GACCCGGGGC GCTACCGCTT CCCCCGGCCG
ATGACCGACC GCCCCGGGCT GGATTTCAGC TTCAGTGGGC TCAAGACCCG GGTGCTGCAG
ACCGTGCAGC AGAGCCGGGA GGCGGACCGG GCGGACATCG CCGCGGCCTT CCAGTCGGCG
GTGGTGGATA CCCTGGTTAT CAAGTGCCGG CGGGCGCTGC GGGCGACCGG CAGCCAGCGG
CTGGTGATCT CCGGCGGTGT GGGGGCCAAT GGTCTGTTGC GTGAGCAGAT GCGCGCCATG
GCGGATCAGG CGGGGGCCAG CCTGCATTAC CCGCGGTTGG CGCTGTGTAC CGACAACGGC
GCCATGGTGG CCTACACCGG CTGGTGCCGC CTGAGCGAGG GCCAGCACGA CGATCTGGAC
TTCAGTGTCA CCGCCCGCTG GCCGCTGGCC GATCTGACCC CGCCCGGGCA GCCGGTCTGA
 
Protein sequence
MRVLGVESSC DETGLAIYDS AQGLMAHALH SQVATHAEYG GVVPELASRD HVRRVVPLTR 
RVLAEAGCRL RDIDAVAYTR GPGLVGALMV GAGMARSLAW GLGVPALGVH HMEAHLLAPM
LEPNPPAFPF VALLVSGGHT LLVQVAGVGR YRVLGETLDD AAGEAFDKTA KLLGLPYPGG
PELEKLAESG DPGRYRFPRP MTDRPGLDFS FSGLKTRVLQ TVQQSREADR ADIAAAFQSA
VVDTLVIKCR RALRATGSQR LVISGGVGAN GLLREQMRAM ADQAGASLHY PRLALCTDNG
AMVAYTGWCR LSEGQHDDLD FSVTARWPLA DLTPPGQPV