Gene Mlg_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0116 
Symbol 
ID4268203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp129056 
End bp130321 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content68% 
IMG OID638124842 
Producttransposase, IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_740963 
Protein GI114319280 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCAA ACGCGTCTGA GTTCAGTCTG TTTTCCGCCG CCCTGGGTCT GGCGCCGCCT 
TGGGAGGTGG TTTCGGTGGA CTTTGACCCG GACGCCAAAC AGATCGACCT GGAGGTGGGC
TTTGCCCGCG GGGCGCGCTT TGCCTGCCCG GCCTGCGGGG CGGCGGATCA GCCGGTCCAC
GACAGCCGCC GGCGTAGCTG GCAGCACCTG CACTTTTTCG AGCACCGCGC CTATATCCAT
GCCCCAGTGC CACGGGTGCG CTGCGGTAGC TGCGGGAAGA CAACGCAAGT GGCGGTGCCT
TGGGCCCGCC CGGGCAGCGG CTTCACCGAA CTCTTTGAGG CGATGGTGGT GACACTGTGC
GCGCAGATGC CGGTCCAGGT GGTGGCCCGC CATCTCGGCG TGGGTGATGA CGCGCTCTGG
CGCATCCTGC ATCACTACGT CGATGCCGCC CGGGCGCAGG AGGACTTCAG CGAGGTCACG
ACGGTGGGGG TCGACGAGAC CGCCGCCCGG CGTGGTCACC ACTACATCAG CATCTTCCAC
GACCTTGAGC GCCGGCGCGT GCTCTACGCC TGCCCGGGGC GCGATCAGCA TACCGTGGCG
CGCTTTGCCG AGGACCTGCG GGCCCACGGC GGTGAGCCCG AGGCCGTTAC CGCCGCCTGT
ATCGATATGT CGAAAGCCTA CATCGCCGGG GTGAGCCGCT ACCTGCCGGC GGCTGCGGTC
ACGTTCGATG GCTTTCACGT CATCCAGCTC GCCAACGCCG CCGTTGACCA GGTACGCCGC
GCCGAGGTCC GGGAGCGGCC GATCCTCAAA CACACCCGCT GGGTGTGGCT GAAAGACGGC
TGGCGCTGGA CCGCCCGCCA ACTCCAGGAC TTCCACACGC TCTCGCGCAC GCAACTCAAG
ACCGCCCGCG CCTGGCGGCT CAAGGATGCC CTGCGCGAGC TCTACGCCCA GGCGCCGCCA
CGCGAGATCG CCGAGGCAGA ACTCCAGCGC TGGTATAGCT GGGCACGGCG CTGCCGGCTC
GCGCCGTTCA AACGCCTGGC GCTCACGATC AAGGCCCACT GGGATGGCAT CCTCAACGCC
TTCGACTCGC GGCTGAACAA CGGCGGCGTC GAGGCCCTCA ACGGACGGAT TCAGGCCGCC
AAGGCTCGCA CCCGCGGCTA CCGCACCGTG CGCAACCTGA TCACCATGAC CTACCTGATC
GGCGGCAAAC TCACCCAACT GCCAAGCTCC CCCTACACCA CAACATCCAG GGTGGCGGCG
GCATGA
 
Protein sequence
MAPNASEFSL FSAALGLAPP WEVVSVDFDP DAKQIDLEVG FARGARFACP ACGAADQPVH 
DSRRRSWQHL HFFEHRAYIH APVPRVRCGS CGKTTQVAVP WARPGSGFTE LFEAMVVTLC
AQMPVQVVAR HLGVGDDALW RILHHYVDAA RAQEDFSEVT TVGVDETAAR RGHHYISIFH
DLERRRVLYA CPGRDQHTVA RFAEDLRAHG GEPEAVTAAC IDMSKAYIAG VSRYLPAAAV
TFDGFHVIQL ANAAVDQVRR AEVRERPILK HTRWVWLKDG WRWTARQLQD FHTLSRTQLK
TARAWRLKDA LRELYAQAPP REIAEAELQR WYSWARRCRL APFKRLALTI KAHWDGILNA
FDSRLNNGGV EALNGRIQAA KARTRGYRTV RNLITMTYLI GGKLTQLPSS PYTTTSRVAA
A