Gene Mlg_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1798 
Symbol 
ID4268717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2053054 
End bp2054343 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content60% 
IMG OID638126554 
Productputative transposase 
Protein accessionYP_742632 
Protein GI114320949 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.247476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGGA AGCCGACCAT TCAGGACGTA TTTCTGGAGC ATTTTGATGG GCTTCGTAAA 
CATCATTGTT TTCCGCTCCC GCAGCTCAAA GCGGCAGAAT CGATTATGAA TTGCCGGACG
GAAGCGCTGG GGGGCCACGA GCAGTGTTGT CCCCATGGCC ACCACCGCCG CGTGCAGTAC
CACTCGTGCC GGCATCGTAG CTGCCCGCAA TGTGCGGAGC GTGCCAAGGC ACAGTGGATG
GAGCGTGAAC GAGGCCGCCT GCTGCCTTGC GATCATTTCC ACGCTATCTT TACCGTGCCC
CATGAATTGA TCCCCCTGTG GCGGTACAAC CGCCGGTCCT TCGTGGCCGC GCTATTCGAT
GCCAGTCGGG ACACCCTCAT CCAACTGCTT GCGCAAGATC GTCACCTAGG GGCGGTCCCC
GGCATTGTGA TGGCCCTGCA CACGTGGGGT CGAACCCTTA ACGCCCACCC CCATGTTCAC
TGCTTGATCA GTGGGGGTGG ATTGACGACG GGTGGACAGT GGCGAGCGGT GAAGGGTGAC
TTTCTGCTGC CTGCGGCCGT GGTGAAGCGC GTGTTCCGGG GCAAACTACT CTCCCGTCTG
CGCCATGATC TGGATACTGG CCGACTGAAA GCGCCCCGTG GCGATCATCC GACCGTGGAT
TACTGGCTCA AGGCCACTGC GAAGAAAGAA TGGAATGTCC GTTTGCAGGG CCGCTATACA
CATGGACGAG GAGTCGCTGC GTACCTGTCT CGCTACGTGC GCGGCGGACC CATCACAAAC
ACCCGGATCA AGTCGTTGGG GGAAGGCCAG GTTCAATTCC GTTACCATGA CCACCGGGAT
GGTCGACGGA AACTGCGGCG TGTGGCGGTG GAGCGATTCA TGGAACAGAT CCTGTGGCAT
GTCCCGGAGC CGGGTCAGCA CACCGTCCGC CACGTTGGTC TGTATGCCCA CACCTGCCGT
GCCAAGCGAA TGGCTTGCCG AGAACAAATG GGGGCGCCTA TGCCCGAAGT GGAGCCGAAG
AAGCAGACCT GGCAGGCGTA CCTGGAGATG CTTGGCCATA GCGACGCCCT GTGCTGTCCC
GCTTGCGGTG CCGAGCTGAT CCGGGGATTC ACGCTTCCCC CGAGTCGCCA CCGGATTCAA
AATTCCCTAA TAAGAAGTGG GTCGCCTCCG AAGGCTTCGG GCAGTGTGCA ACAAGGCGTT
GAACCGGACC CGCTAGTGCC TTCGAGGAGG TCCGAGGTGA ATGAGGGCGA TCGGCATTTT
TATTTGCCTC GATGCGGGCC GGTTAACTAG
 
Protein sequence
MARKPTIQDV FLEHFDGLRK HHCFPLPQLK AAESIMNCRT EALGGHEQCC PHGHHRRVQY 
HSCRHRSCPQ CAERAKAQWM ERERGRLLPC DHFHAIFTVP HELIPLWRYN RRSFVAALFD
ASRDTLIQLL AQDRHLGAVP GIVMALHTWG RTLNAHPHVH CLISGGGLTT GGQWRAVKGD
FLLPAAVVKR VFRGKLLSRL RHDLDTGRLK APRGDHPTVD YWLKATAKKE WNVRLQGRYT
HGRGVAAYLS RYVRGGPITN TRIKSLGEGQ VQFRYHDHRD GRRKLRRVAV ERFMEQILWH
VPEPGQHTVR HVGLYAHTCR AKRMACREQM GAPMPEVEPK KQTWQAYLEM LGHSDALCCP
ACGAELIRGF TLPPSRHRIQ NSLIRSGSPP KASGSVQQGV EPDPLVPSRR SEVNEGDRHF
YLPRCGPVN