Gene Mlg_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0637 
Symbol 
ID4270826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp687513 
End bp688748 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content64% 
IMG OID638125385 
Productphage integrase family protein 
Protein accessionYP_741481 
Protein GI114319798 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.102856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGA AAGCCAAAGA GTTGTCAGCG ATCCAAGTTA AGCGACTATC GAAGGTGGGG 
GTGCATGCTG TCGGCGGTGT ATCCGGTCTG CTGCTGCGCG TCTACTCCAG CGGTGCCCGA
TGCTGGGTTC TGCGCACCAG AGCCGGAGGG AAGCGACGGG ACTATGGTCT CGGTGGATAC
CCAACAGTGA CACTCGCTCA GGCGCGTGAA AGGGCGCGTG AGCTGCTGGA CGAGCTGTGG
CGCGGCAATG ACCCCGTTGC CGAGCGTCGG GGGCGTATAG CGGAGCAACG GGCGGCTGAG
GCCAAGCGCC TGACGTTCGC TCAGGCCGCC GCCCAATGCC ACCAGGCGAA GGCCGCCGAG
TTCCGCAACG CCAAGCATAA GCGCGACTGG ATAAGCAGTC TTGAGCGTCA TGCCTTCCCG
GTGCTGGGTG ATCTGCCGGT GGCCGACATT GAGTTGGCCC ACGTCCTCAA GGTGTTGGAA
CCGATATGGA AGGAAAAGAC CGAGACCGCC ACGCGGGTAC GCCAGCGGGT GGAGTCGGTT
ATCACCTGGG CGACTGTCTC CGGTTATCGG GAGGGCGAGA ACCCGGCCCG ATGGGCTGGC
AACCTGGAGG TTGCGCTGCC GGCTCCGAAC AAGATCAGGA AGGTGAAGCA CCACCGCGCT
CTACCGTGGA AGGAAGTGCC GGAGTTCATG GCCGCGTTGA GACAGCGCCA AGGCATGGCC
GCACTCGCCT TGGAGTTCGC CATCCTCACC GCTGCCCGCT CCGGTGAGGT GCGCGGTGCC
ACCTGGGACG AGATCGACTT AGACGCCCGC GTCTGGACGG TGCCAGCCGA CAGGATCAAG
GCCGGTAAGC CCCACCGCGT ACCGCTGTCC GATGATGCAG TGGCGGTGCT GGAGCGCGTG
CCACGGATGG AGGGCAGTAA CCTTGTGTTC ACCGCGCCGC GTGGTGGCCA GTTGTCCGAC
ATGACGTTGG GCGCTGTCCT CAAGCGGATG GAGGTAGACG CCACGGCCCA CGGGTTCCGC
TCCACGTTCA AGGATTGGGC GCGATCCTGC ACCAGCTATC AGGACGAGGT GTCAGAGCTG
GCCCTAGCTC ACGTCAACAG CGACGCCACC CGCGCCGCCT ATGCGCGTGA CGAGCTGTTG
CCACAGCGCA AGCGGCTGAT GCAGGAGTGG GCGCGTTACT GCCGAGACGG CATGCCGGAG
ACGGCAAGCG TTACTCCTAT CGGGCACAAC ATCTAG
 
Protein sequence
MPKKAKELSA IQVKRLSKVG VHAVGGVSGL LLRVYSSGAR CWVLRTRAGG KRRDYGLGGY 
PTVTLAQARE RARELLDELW RGNDPVAERR GRIAEQRAAE AKRLTFAQAA AQCHQAKAAE
FRNAKHKRDW ISSLERHAFP VLGDLPVADI ELAHVLKVLE PIWKEKTETA TRVRQRVESV
ITWATVSGYR EGENPARWAG NLEVALPAPN KIRKVKHHRA LPWKEVPEFM AALRQRQGMA
ALALEFAILT AARSGEVRGA TWDEIDLDAR VWTVPADRIK AGKPHRVPLS DDAVAVLERV
PRMEGSNLVF TAPRGGQLSD MTLGAVLKRM EVDATAHGFR STFKDWARSC TSYQDEVSEL
ALAHVNSDAT RAAYARDELL PQRKRLMQEW ARYCRDGMPE TASVTPIGHN I