Gene Mlg_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1799 
Symbol 
ID4268718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2054591 
End bp2055979 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content59% 
IMG OID638126555 
Productintegron integrase 
Protein accessionYP_742633 
Protein GI114320950 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID[TIGR02249] integron integrase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.174616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGATC CAAGCCGCAA AGATGCCCGT GCCCGGTTCT GGGATCGTTA CATCCAGGCT 
CTCCGAAAGC AAGGCGTGAA GCCACCCTTC GACCGATGGC ACGTACGTCG GGCGGAGGAG
TTCATCAAGG CGCACCCTGA TCAAAGGCTT CGCGAGTTGG GCCCTGATGG CGTGCAAAGC
TACCTGCGAC GGAAGGGACA CGCCGGTAAT CTCAAGGCCT GGCAGTTCCA GCAAGTGGTT
CGTGCTATAC AGACGCTGTA TAGCACGGCT AACATAGCAT GGGCTGCGAG GTTCGACTGG
CAGTATTGGA TGGATGCCGC CCGAACGCTC GAGCCTGAAC ACGTTTCTAC GGCCCGGGAC
ACCCCAGCGC CAACTCCGGT ACGGCTTGCC GAACGTGTTG CTGACACTCG TCTGGCACAC
GTGGTACACG CGCACCCCGC GCTTTTTGAG GCTTTTTCTG AAACAGTCAC CAGCAGAGGC
TTGGCGATCA GCACCGAGAA AACCTACCTG GCGTGGACCT GCCGGTTCAT TGTGTTTCTG
CAGTCGAAGC CTGTGGAAAG CGCGGACGAG GCGGATGTTC AACAATTTCT CAGTCAGTTG
GTCACACAGC GCAATGTAGC CGCCAGCACC CAGAGCCAGG CTCTGAACGC TTTGGTCTTC
TTCTTCCGGT ACGTTCTGCA GCGACCGCTT GGCGAACTCC AGGCTATTGA GAGGTCAAAA
CGGCCCAGGC GCCTTCCTGT GGTTCTCTCT CGCAATGAGG TCAAGGCGCT GTTGAGTGAA
ATGCACGGTG TTCACCATTT GCTGGCCTCA TTGCTTTATG GTTCCGGCAT GCGGGTCATG
GAAGCACTAC GATTGCGCGT CAAAGATATA GATTTCGAGC GGCGGGCAAT CACGGTGAGA
TCCGGCAAAG GCGATAAGGA TCGCGTGGTA CCCCTTCCCG AATCCACGAT AGCGCCGCTC
CAGAACCACT TGCGGCAGGT GAAGGCCATT CACGAGGGAG ACCTGGCCGA TGGGTACGGT
GAAGCCTCCC TCCCCCATGC CCTGGCCCGG AAGTACCGCA ATGCCGGGAA GGAATGGCAT
TGGCAGTATG TGTTTCCCAG TGCCCGGGTA GCGGTCGATC CCCGATCAGG GGCGATCCGT
CGACATCATC TTCATGAAAC CGCTGTCCAG CGTGCGGTAA AACGGGCGGT GAGAGCCTCC
AGAATACCGA AACAGGCCAG TTGCCACACC TTGCGGCACA GTTTTGCCAC GCACCTGCTG
GAGCGCGGCC AGGATATCCG AACCGTGCAA GAGCTCCTGG GCCACGCGGA TGTATCGACG
ACCATGATCT ACACGCATGT AATGAGCAAA GGTGGCGTCC CTGCGGCGAG CCCGTTGGAC
GGGCTTTAG
 
Protein sequence
MVDPSRKDAR ARFWDRYIQA LRKQGVKPPF DRWHVRRAEE FIKAHPDQRL RELGPDGVQS 
YLRRKGHAGN LKAWQFQQVV RAIQTLYSTA NIAWAARFDW QYWMDAARTL EPEHVSTARD
TPAPTPVRLA ERVADTRLAH VVHAHPALFE AFSETVTSRG LAISTEKTYL AWTCRFIVFL
QSKPVESADE ADVQQFLSQL VTQRNVAAST QSQALNALVF FFRYVLQRPL GELQAIERSK
RPRRLPVVLS RNEVKALLSE MHGVHHLLAS LLYGSGMRVM EALRLRVKDI DFERRAITVR
SGKGDKDRVV PLPESTIAPL QNHLRQVKAI HEGDLADGYG EASLPHALAR KYRNAGKEWH
WQYVFPSARV AVDPRSGAIR RHHLHETAVQ RAVKRAVRAS RIPKQASCHT LRHSFATHLL
ERGQDIRTVQ ELLGHADVST TMIYTHVMSK GGVPAASPLD GL