Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1530 |
Symbol | |
ID | 4270535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1744128 |
End bp | 1745699 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638126288 |
Product | integrase catalytic subunit |
Protein accession | YP_742369 |
Protein GI | 114320686 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACAC GGGCTGGCGA GGACGGGGTA TTGAAGGAGT GGGTCGTGAT ACACAAGATC AAGGCACTGT ACGACGAAGG CCGTGGGCTC TCAGTTCGGG CCATCAGCCG GGAGCTGGGC ATCTCGCGCA ACACGGTGCG CAAGTACCTG CGGGCGGACA CCGAAGCGGT CGCAGCGGAG CGGGCCGATG GGCGCCGGGG CCGGCTGCTG GATGAGCACC GGGCTTACAT GGAGTATTTG CTGCGCCGCT ACCCGCAGCT CAGCGCCGTG AAGGTGGCGC GCAAGCTCCG GGACAAGGTC GGTGACCTGG CGGTCTCGGA CCGCAGCCTG CGCCGGTATC TGCAGGAGCT GCGCGCCAGC GTCCAAGTGG CCCAGCCGCG CTACTACGAG CCGGTGCTGG ACGTGGTGCC GGGCGTACAG TGCCAGGTGG ACCCCGGTGA GTTGCGGGGC GTGGCCATCA GCGGCGTGGA GCGCACGGTC TACTTCGTGG TCTTCGTGCT CTCGTTCTCG CGGCTGATGC ACGTTGCGGT GGCCTTCCGG CCCATCGACA CGGCGCTGTT CATCCGCATG CATGATGAGG CGCTGCGGGC CTTTGGCGGT ACCCCGGAGG AGTGCGTCTA CGACCAGACG AAGATGGTGG TCATCGCCGA GCAGTTCCGG GAGCTGACGG TCAACGAGCG CTTCCATGAG TACGCCACCG GTGCGGGCTT TCGCATCCAT GCCTGCCGGG GGTACGACCC GGAGAGCAAG GGCAAGGTGG AGGCCGGGGT GAAGTACGTT AAGCGCGATT GCCTGTACGG GGAGCGCTTT GCCGACGAGG CAGACGTCCG CGCCCACGTC CAGCAGTGGC TCGACCAAGT GGCCAATGTC CGCCGCCACG GCACCACCGG GCGTGAGCCC CGGGGGCACT TTGAGGCTGA AGAGCGGGCG CACCTACGGG CCTACCTCAC CCCCTCGTGC TTGACCCAGG CGGCTGCGGC GCGCCAGACC CGCAAGGTGG ACAAGACCGG GCTGATCGCC TGGCACTCGA ACAAGTATTC GGTACCCATG CGCTACCAGC GTGGCCGGGT GGGCGTGCAG GCCGACGAGA CCCAGCTTCA CATCCTCGAC CTGGAAAGCG GTGAGATCGT GGCCACCCAT ACACTGGCCA CGGGCAAGGG CCAGACGGTG CGTAACACCG ACCACTACCG GGATCGCCGA CAGCAGATCG AGACCCTGGA GGCCGCCATT GGCGAACGCG TGGGCGAGCA GACCGGAGCC CGGCTGTGTG CCCGGTTGCG GGCCAGCAAC CCGCGGATCT ATCGCGACCA GGTGGCCGCC GTACACGCCC TGCTGGAGAG CGGGCCGCCC CCGGCACCCG GACTGGTCGA GGACCTGGCC GGGCGCGAAG GGATGACCGC CACCCGCTTC AAGGCCCAAC TGCAGGCGGC ACACCGGGCC CAGGAGCGGG GCCGGGACCT CGAAGCGGAT GCCGACGAGC CCGCCGTGGA CGCGCAGGCA CTGGCCCTGT CGGCCTACGC CCATCTTGGC CAGTCGGCCG GCCAGGAGGA GCTGACCCAT GAGCCTGCTT GA
|
Protein sequence | MATRAGEDGV LKEWVVIHKI KALYDEGRGL SVRAISRELG ISRNTVRKYL RADTEAVAAE RADGRRGRLL DEHRAYMEYL LRRYPQLSAV KVARKLRDKV GDLAVSDRSL RRYLQELRAS VQVAQPRYYE PVLDVVPGVQ CQVDPGELRG VAISGVERTV YFVVFVLSFS RLMHVAVAFR PIDTALFIRM HDEALRAFGG TPEECVYDQT KMVVIAEQFR ELTVNERFHE YATGAGFRIH ACRGYDPESK GKVEAGVKYV KRDCLYGERF ADEADVRAHV QQWLDQVANV RRHGTTGREP RGHFEAEERA HLRAYLTPSC LTQAAAARQT RKVDKTGLIA WHSNKYSVPM RYQRGRVGVQ ADETQLHILD LESGEIVATH TLATGKGQTV RNTDHYRDRR QQIETLEAAI GERVGEQTGA RLCARLRASN PRIYRDQVAA VHALLESGPP PAPGLVEDLA GREGMTATRF KAQLQAAHRA QERGRDLEAD ADEPAVDAQA LALSAYAHLG QSAGQEELTH EPA
|
| |