Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1061 |
Symbol | |
ID | 4268982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1239341 |
End bp | 1240735 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638125812 |
Product | phage integrase family protein |
Protein accession | YP_741903 |
Protein GI | 114320220 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.69421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.669577 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCGCA AGGGTATTAC GGCCAAGAAA CTGGAGGCGC TCCAGGGCAA GCGCCGCAAG TCGGCCACCA GGGAGTGGGA CGGTGACGGT TCAGGGTTCG GGGTGAAGGT GTCGGCAGCC GGGCGGCTGA CATTCTTTCA GTTCTACTAC TCCCCCGAGG GCACCACCGA TAAAGACGGT AACGACATTA CCGGCAAGCG GCGGTTCATG GGGCTGGGTA ACTACCCTGA GACAACGCTC GCAGAGGCGC GGGAGAAGGC GCAGGAGGCG CGAGAGTTGC TAGAGCGGGG CATTGATCCG CAGGAGCACG CCCGAGAGCA GCAGGAGGCG CACAGGCGGG AGAAACGCAA GCGGGCGCAA CGGGGCACGC TGGCAGGGGT GGCGGCGCTG TACCTGTGGC ACATGCGCAA GCGCGGGCGC TCCAGGGAGT ACATTACCGC CGTTCGCCGT GGGTTCCACC GCGACGTGTT CCCGGTGGTG CCCCGTGACA CCAAGGCCGG GGACGTGGAG CCGGAGGACG TGCAACTGAT CCTGCACCGG CCCCTAAGCC GTGGCGCTGA CCATACCGCC CGAGTGCTGC GGGCTAACCT GCATCGCGCG TTCAAGTTGG CCATTCAGGC GGATAACGAC CCGCGCAACC TGGGCAGTGC CGTTAAGTTC CGCGTGCGCC ACAACCCGGT GGAGGACGTG CCGCTTGAGG TGCACGTTAC GCCCGGAGAT CGGGAGCTTT CATTTAGCGA GATCGGGCGC GTATGGCGGG AGGCCGACCA CGCGACCCCT TACCCGCAGG ATGCGTTGCT GTTGCGCCTG CTGCTGGCGC TGGGCGGCCA GCACATTACC GAGCTGAGGG AAGCACAGTG GCCCGAGTTC GATTTGCAGG CCGGACAGTG GCACCTGAAA GCCGCCCGGC ACAAAAACCG CACCCGCGAC CACCTGGTAC CAATCAACAG CACCGCCGCC GAAGTGCTGG AGGAATTGCG GGCGCTGACT GGTGGGCAAG GGTACCTGTT CCCGCAGCTA CGCAACGCGC ACAAGCCCAT GCGAGCCGAG AGGCCGGGCG CTATCGTTCG CGGTCTGCTG GCGCACCTGG AGGCGCAGGG CGAGCCGATG GAGAAGTTTA CGGCCTCAGA CTTCCGGCGC ACCTGCAAAA CGCGGATGCA CGAGATAGGG ATTCCCAAAA CCACCACCAA CCACCTGCAC AACCATGACT TTGGTGGCGT GAGCGCGAAG CACTACGACC GCTACGACTA TTGGGGCGAG AAACAGCGGG CCATGCGGGC ATGGGATATT GCGCTGAAAG CCGCCATTGC AGGCGAGCCG GTGCCCGAGG CGCGTTGCCG GGCGGCGCTC CAGTGGGACG AGAATGGCGG GGCGCGGCTG GAGGTGGTGG GCTAG
|
Protein sequence | MARKGITAKK LEALQGKRRK SATREWDGDG SGFGVKVSAA GRLTFFQFYY SPEGTTDKDG NDITGKRRFM GLGNYPETTL AEAREKAQEA RELLERGIDP QEHAREQQEA HRREKRKRAQ RGTLAGVAAL YLWHMRKRGR SREYITAVRR GFHRDVFPVV PRDTKAGDVE PEDVQLILHR PLSRGADHTA RVLRANLHRA FKLAIQADND PRNLGSAVKF RVRHNPVEDV PLEVHVTPGD RELSFSEIGR VWREADHATP YPQDALLLRL LLALGGQHIT ELREAQWPEF DLQAGQWHLK AARHKNRTRD HLVPINSTAA EVLEELRALT GGQGYLFPQL RNAHKPMRAE RPGAIVRGLL AHLEAQGEPM EKFTASDFRR TCKTRMHEIG IPKTTTNHLH NHDFGGVSAK HYDRYDYWGE KQRAMRAWDI ALKAAIAGEP VPEARCRAAL QWDENGGARL EVVG
|
| |