Gene Mlg_1529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1529 
Symbol 
ID4270534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1742504 
End bp1744021 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content58% 
IMG OID638126287 
Productintegrase catalytic subunit 
Protein accessionYP_742368 
Protein GI114320685 
COG category[L] Replication, recombination and repair 
COG ID[COG2801] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.544255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTCGT ATGAAGACCG GATGAGAGCC GTGAAGCTCT ACATCCAGTA CCACAGGAGT 
GCCGCCGCCA CCGTCCGAGA GCTCGGCTAC CCCTCGAAGA AAAACCTGCG CCGTTGGCAT
GAAGCCTACA TGAGAACCGG GGACCTGCCG GAGCGCAGTG CTCCTAAGCC GAAGTACTCG
CCGGAACAGA AGCAAAAGGC TGTGCAGCAC TACCTGGGCC ATGGTTGCTG TCTCGCAAGG
ACCCGAAAGG CCCTGGGTTA TCCAAGCGTG GGCGTTCTAC GTGAGTGGGT CATGGAGCAG
AACCCCGGGC GGCGGCAGGT CTCAGCCGGC AGCTCAAAAG GTCCACCGCT CTCACCTGAG
GCGAAACGTG AGGCGGTCAT TGAACTGTGT TCCCGCCAAG ATCCCGCCAC GAAGGTTGCC
AAAAACCTGG GCGTTAGCAG GCAGGTTCTG TACAAGTGGA CAGACCAGTT GCTTGGTGAT
GAGGCCAATC CACGGATGAA ACGACGCGAT GATGAGGTGT TGCCGCTGGA GCAGGAAGTT
AAAGAGCTGC AAAGGCGCGT CCATCGACTC CAGCTCGAGC ACGATCTTCT GGCGAAGGCG
AACGACCTGA TAAAAAAGGA TCTTGGCGTC GACCTCCGGC TCCTGACGAA CAGGGAGAAG
ACCCTGCTGG TTGACGCCCT GAGACAGACT TATCGGCTCT CGGAAATCTT GTCGCAACTG
TGCCTGCCGC GCAGTTCCTA TTTCTACCAT CGGGCAAGGA TCCAGCTACC TGACAAGTAT
GCAACCGTTC GCGTGAGCGC GACCCAACTC TTCGAGACCA ACCACCGCTG CTACGGTTAC
CGGCGCATAC GGGTTGCTCT CAACCGACTT GGCATCGTGA TCTCTGAGAA GGTAGTCCGG
CGCCTTATGG CTGAGGAACA GCTCATCGTC CAAAGGCGCA AGTGTCGACG GTACAGGTCG
TACCGCGGAG AAATCACTCC GGCCCCCGAG AATCTGGTCA ATCGGGATTT CAGCGCCCCT
GCGCCCAACC GCAAGTGGCT GACGGATATC ACGGAGTTCC AAATCCCTGC AGGCAAGGTT
TACCTCTCTC CAGTGATCGA CTGCTTCGAT GGGTTGGCGG TGAGCTGGAC TATAAGCACA
AGCCCCGATG CCACCCTGGT AAACACGATG CTTGATGACG CCATAGCAAC GCTGGAGGAC
GGCGAGACAC CTATTATCCA CAGCGATAGA GGAGCTCATT ACCGCTGGCC CGGATGGCTT
GCCCGCATCC AAGATGTCGG CCTTATCCGG TCGATGTCGC GCAAAGGCTG TTCGCCGGAC
AATGCCGCCT GTGAGGGCTT CTTCGGGCGA CTCAAGACCG AGTTCTTCTA CCCACGCGAC
TGGCACGGGA TAACACTTGA GCAATTCTTC GAAGAGCTCG ACGAGTATAT CCGGTGGCAC
AACCGAAACA GGGTTAAGCT GTCCTTGGGA GGCCAGAGCC CGCTCGAGTA TCGAGAGCGC
TTAGGACTGG CCGCATGA
 
Protein sequence
MYSYEDRMRA VKLYIQYHRS AAATVRELGY PSKKNLRRWH EAYMRTGDLP ERSAPKPKYS 
PEQKQKAVQH YLGHGCCLAR TRKALGYPSV GVLREWVMEQ NPGRRQVSAG SSKGPPLSPE
AKREAVIELC SRQDPATKVA KNLGVSRQVL YKWTDQLLGD EANPRMKRRD DEVLPLEQEV
KELQRRVHRL QLEHDLLAKA NDLIKKDLGV DLRLLTNREK TLLVDALRQT YRLSEILSQL
CLPRSSYFYH RARIQLPDKY ATVRVSATQL FETNHRCYGY RRIRVALNRL GIVISEKVVR
RLMAEEQLIV QRRKCRRYRS YRGEITPAPE NLVNRDFSAP APNRKWLTDI TEFQIPAGKV
YLSPVIDCFD GLAVSWTIST SPDATLVNTM LDDAIATLED GETPIIHSDR GAHYRWPGWL
ARIQDVGLIR SMSRKGCSPD NAACEGFFGR LKTEFFYPRD WHGITLEQFF EELDEYIRWH
NRNRVKLSLG GQSPLEYRER LGLAA