Gene Mlg_2334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2334 
Symbol 
ID4269090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2644456 
End bp2646027 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content68% 
IMG OID638127092 
Productintegrase catalytic subunit 
Protein accessionYP_743164 
Protein GI114321481 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACAC GGGCTGGCGA GGACGGGGTA TTGAAGGAGT GGGTCGTGAT ACACAAGATC 
AAGGCACTGT ACGACGAAGG CCGTGGGCTC TCAGTTCGGG CCATCAGCCG GGAGCTGGGC
ATCTCGCGCA ACACGGTGCG CAAGTACCTG CGGGCGGACA CCGAAGCGGT CGCAGCGGAG
CGGGCCGATG GGCGCCGGGG CCGGCTGCTG GATGAGCACC GGGCTTACAT GGAGTATTTG
CTGCGCCGCT ACCCGCAGCT CAGCGCCGTG AAGGTGGCGC GCAAGCTCCG GGACAAGGTC
GGTGACCTGG CGGTCTCGGA TCGCAGTCTC CGGCGGTATC TGCAGGAGCT GCGCGCCAGC
GTCCAAGTGG CCCAGCCGCG CTACTACGAG CCGGTGCTGG ACGTGGTGCC GGGCGTACAG
TGCCAGGTGG ACCCCGGTGA GTTGCGGGGC GTGGCCATCA GCGGCGTGGA GCGCACGGTC
TACTTCGTGG TCTTCGTGCT CTCGTTCTCG CGGCTGATGC ACGTTGCGGT GGCCTTCCGG
CCCATCGACA CGGCGCTGTT CATCCGCATG CATGATGAGG CGCTGCGGGC CTTTGGCGGT
ACCCCGGAGG AGTGCGTCTA CGACCAGACG AAGATGGTGG TCATCGCCGA GCAGTTCCGG
GAGCTGACGG TCAACGAGCG CTTCCATGAG TACGCCACCG GTGCGGGCTT TCGCATCCAT
GCCTGCCGGG GGTACGACCC GGAGAGCAAG GGCAAGGTGG AGGCCGGGGT GAAGTACGTT
AAGCGCGATT GCCTGTACGG GGAGCGCTTT GCCGACGAGG CAGACGTCCG CGCCCACGTC
CAGCAGTGGC TCGACCAAGT GGCCAATGTC CGCCGCCACG GCACCACCGG GCGTGAGCCC
CGGGGGCACT TTGAGGCTGA AGAGCGGGCG CACCTACGGG CCTACCTCAC CCCCTCGTGC
TTGACCCAGG CGGCTGCGGC GCGCCAGACC CGCAAGGTGG ACAAGACCGG GCTGATCGCC
TGGCACTCGA ACAAGTATTC GGTACCCATG CGCTACCAGC GTGGCCGGGT GGGCGTGCAG
GCCGACGAGA CCCAGCTTCA CATCCTCGAC CTGGAAAGCG GTGAGATCGT GGCCACCCAT
ACACTGGCCA CGGGCAAGGG CCAGACGGTG CGTAACACCG ACCACTACCG GGATCGCCGA
CAGCAGATCG AGACCCTGGA GGCCGCCATT GGCGAACGCG TGGGCGAGCA GACCGGAGCC
CGGCTGTGTG CCCGGTTGCG GGCCAGCAAC CCGCGGATCT ATCGCGACCA GGTGGCCGCC
GTACACGCCC TGCTGGAGAG CGGGCCGCCC CCGGCACCCG GACTGGTCGA GGACCTGGCC
GGGCGCGAAG GGATGACCGC CACCCGCTTC AAGGCCCAAC TGCAGGCGGC ACACCGGGCC
CAGGAGCGGG GCCGGGACCT CGAAGCGGAT GCCGACGAGC CCGCCGTGGA CGCGCAGGCA
CTGGCCCTGT CGGCCTACGC CCATCTTGGC CAGTCGGCCG GCCAAGAGGA GTTGACCCAT
GAGCCTGCTT GA
 
Protein sequence
MATRAGEDGV LKEWVVIHKI KALYDEGRGL SVRAISRELG ISRNTVRKYL RADTEAVAAE 
RADGRRGRLL DEHRAYMEYL LRRYPQLSAV KVARKLRDKV GDLAVSDRSL RRYLQELRAS
VQVAQPRYYE PVLDVVPGVQ CQVDPGELRG VAISGVERTV YFVVFVLSFS RLMHVAVAFR
PIDTALFIRM HDEALRAFGG TPEECVYDQT KMVVIAEQFR ELTVNERFHE YATGAGFRIH
ACRGYDPESK GKVEAGVKYV KRDCLYGERF ADEADVRAHV QQWLDQVANV RRHGTTGREP
RGHFEAEERA HLRAYLTPSC LTQAAAARQT RKVDKTGLIA WHSNKYSVPM RYQRGRVGVQ
ADETQLHILD LESGEIVATH TLATGKGQTV RNTDHYRDRR QQIETLEAAI GERVGEQTGA
RLCARLRASN PRIYRDQVAA VHALLESGPP PAPGLVEDLA GREGMTATRF KAQLQAAHRA
QERGRDLEAD ADEPAVDAQA LALSAYAHLG QSAGQEELTH EPA