Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0116 |
Symbol | |
ID | 4268203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 129056 |
End bp | 130321 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638124842 |
Product | transposase, IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_740963 |
Protein GI | 114319280 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCAA ACGCGTCTGA GTTCAGTCTG TTTTCCGCCG CCCTGGGTCT GGCGCCGCCT TGGGAGGTGG TTTCGGTGGA CTTTGACCCG GACGCCAAAC AGATCGACCT GGAGGTGGGC TTTGCCCGCG GGGCGCGCTT TGCCTGCCCG GCCTGCGGGG CGGCGGATCA GCCGGTCCAC GACAGCCGCC GGCGTAGCTG GCAGCACCTG CACTTTTTCG AGCACCGCGC CTATATCCAT GCCCCAGTGC CACGGGTGCG CTGCGGTAGC TGCGGGAAGA CAACGCAAGT GGCGGTGCCT TGGGCCCGCC CGGGCAGCGG CTTCACCGAA CTCTTTGAGG CGATGGTGGT GACACTGTGC GCGCAGATGC CGGTCCAGGT GGTGGCCCGC CATCTCGGCG TGGGTGATGA CGCGCTCTGG CGCATCCTGC ATCACTACGT CGATGCCGCC CGGGCGCAGG AGGACTTCAG CGAGGTCACG ACGGTGGGGG TCGACGAGAC CGCCGCCCGG CGTGGTCACC ACTACATCAG CATCTTCCAC GACCTTGAGC GCCGGCGCGT GCTCTACGCC TGCCCGGGGC GCGATCAGCA TACCGTGGCG CGCTTTGCCG AGGACCTGCG GGCCCACGGC GGTGAGCCCG AGGCCGTTAC CGCCGCCTGT ATCGATATGT CGAAAGCCTA CATCGCCGGG GTGAGCCGCT ACCTGCCGGC GGCTGCGGTC ACGTTCGATG GCTTTCACGT CATCCAGCTC GCCAACGCCG CCGTTGACCA GGTACGCCGC GCCGAGGTCC GGGAGCGGCC GATCCTCAAA CACACCCGCT GGGTGTGGCT GAAAGACGGC TGGCGCTGGA CCGCCCGCCA ACTCCAGGAC TTCCACACGC TCTCGCGCAC GCAACTCAAG ACCGCCCGCG CCTGGCGGCT CAAGGATGCC CTGCGCGAGC TCTACGCCCA GGCGCCGCCA CGCGAGATCG CCGAGGCAGA ACTCCAGCGC TGGTATAGCT GGGCACGGCG CTGCCGGCTC GCGCCGTTCA AACGCCTGGC GCTCACGATC AAGGCCCACT GGGATGGCAT CCTCAACGCC TTCGACTCGC GGCTGAACAA CGGCGGCGTC GAGGCCCTCA ACGGACGGAT TCAGGCCGCC AAGGCTCGCA CCCGCGGCTA CCGCACCGTG CGCAACCTGA TCACCATGAC CTACCTGATC GGCGGCAAAC TCACCCAACT GCCAAGCTCC CCCTACACCA CAACATCCAG GGTGGCGGCG GCATGA
|
Protein sequence | MAPNASEFSL FSAALGLAPP WEVVSVDFDP DAKQIDLEVG FARGARFACP ACGAADQPVH DSRRRSWQHL HFFEHRAYIH APVPRVRCGS CGKTTQVAVP WARPGSGFTE LFEAMVVTLC AQMPVQVVAR HLGVGDDALW RILHHYVDAA RAQEDFSEVT TVGVDETAAR RGHHYISIFH DLERRRVLYA CPGRDQHTVA RFAEDLRAHG GEPEAVTAAC IDMSKAYIAG VSRYLPAAAV TFDGFHVIQL ANAAVDQVRR AEVRERPILK HTRWVWLKDG WRWTARQLQD FHTLSRTQLK TARAWRLKDA LRELYAQAPP REIAEAELQR WYSWARRCRL APFKRLALTI KAHWDGILNA FDSRLNNGGV EALNGRIQAA KARTRGYRTV RNLITMTYLI GGKLTQLPSS PYTTTSRVAA A
|
| |