Gene Mlg_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1804 
Symbol 
ID4269466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2062765 
End bp2064264 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content66% 
IMG OID638126560 
Productlysyl-tRNA synthetase 
Protein accessionYP_742638 
Protein GI114320955 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA CCGAGCAGGA CGACAACAAG CTCATCGCCC AGCGGCGGGA AAAGCTCGCC 
GCGTTGCGCG AGGCCGGCCA GGCCTTCCCC AACGACTTCC GCCGCGACAG CCTGGCAGCG
GACATCCACG CCCGCTGCGC GGAGCTGGAC GACGAGGCGC TGGAGGCGGA GAACATCCGC
GTGCGGGTGG CCGGGCGGAT GATGGCCAAG CGGGTGATGG GCAAGGCGAG CTTTACCCAC
CTGCAGGATC AGTCCGGTCG CATCCAGCTC TTTCTGGCCC GAGATGAGCT GCCGGAGGGG
GTCTACCAGC AGTTCAAGGG CTGGGATGTC GGTGACATCA TCGGCGCGGC GGGGACGTTG
TTCCGCACCC GCAAGGGCGA GCTGTCGGTT AAGGTGGACG AGCTGCGCCT GCTCACCAAG
TCGCTGCGCC CGCTGCCCGA GAAGTACCAC GGGCTGACGG ACACCGAGGC GCGCTACCGC
CAGCGCTACG TCGACCTGAT CATGAACGAC GACTCACGCC GGGTCTTCAT GCTGCGCAGC
CGGCTGGTGG CGGGCATTCG TGACTTTTTG AACGGCCGTG GTTTTCTCGA GGTGGAGACG
CCGATGATGC AGCCCATTCC GGGCGGCGCG ACGGCGCGAC CGTTCGTGAC CCATCACAAC
GCCCTGGGCG CGGACCTGTA CCTGCGGGTG GCGCCGGAGC TGTACCTGAA GCGGCTGGTG
GTGGGCGGCT TCGAACAGGT CTACGAGATC AACCGGAATT TCCGTAACGA GGGGGTGAGC
ACCCGCCACA ACCCCGAGTT CACGATGCTG GAGTTCTATC AGGCCTACGC GGATCACAAC
GACCTGATGG ACCTGACCGA GGCTATGCTG CGGCGGCTGG CCGAGGAGCA GTTGGGCACG
ACGCAGATCA CCTATCAGGG TGAAACCTTC GACTTCGGCC GGCCCTTCCG GCGGATCCGG
ATGGTGGATG CGATCTGTGA GTTCAACCCC GACATCGGGC CGGAGGCGCT GACCGACCGG
GATTCGGCGC TCAACCTGGC TGGGCACCTG AACATCCCGC TGATGGGGCA TGAGGGGCTC
GGCAAGCTGC AGATGGTGAT CTTCGAGACC ACGACGGAGC ACAAGCTGCG CGAGCCCACG
TTCGTGACCC ACTACCCCAA GGAGGTCTCG CCGCTGGCCC GGCCGGTGGA TGACGACCCC
TTCTACACCG AGCGGTTCGA GCTGATCGTC GGCGGCCGGG AGATTGCCAA TGGCTTCTCC
GAGCTGAACG ACGCCGAGGA CCAGGCGGAG CGGTTCCGGG CCCAGGCGGC GGAAAAGGCC
GCCGGCGATG ACGAGGCGAT GCACTACGAC GCCGATTTCA TCCGGGCGCT GGAGTACGGG
TTACCCCCCA CGGCGGGCGA GGGCATCGGC ATCGACCGGC TGGTGATGCT CTTCGCCGAC
GCCCCATCCA TCCGGGACGT CCTGCTGTTC CCGGCCATGC GCCCGGAGAC GGGGGAGTAA
 
Protein sequence
MTMTEQDDNK LIAQRREKLA ALREAGQAFP NDFRRDSLAA DIHARCAELD DEALEAENIR 
VRVAGRMMAK RVMGKASFTH LQDQSGRIQL FLARDELPEG VYQQFKGWDV GDIIGAAGTL
FRTRKGELSV KVDELRLLTK SLRPLPEKYH GLTDTEARYR QRYVDLIMND DSRRVFMLRS
RLVAGIRDFL NGRGFLEVET PMMQPIPGGA TARPFVTHHN ALGADLYLRV APELYLKRLV
VGGFEQVYEI NRNFRNEGVS TRHNPEFTML EFYQAYADHN DLMDLTEAML RRLAEEQLGT
TQITYQGETF DFGRPFRRIR MVDAICEFNP DIGPEALTDR DSALNLAGHL NIPLMGHEGL
GKLQMVIFET TTEHKLREPT FVTHYPKEVS PLARPVDDDP FYTERFELIV GGREIANGFS
ELNDAEDQAE RFRAQAAEKA AGDDEAMHYD ADFIRALEYG LPPTAGEGIG IDRLVMLFAD
APSIRDVLLF PAMRPETGE