Gene Mlg_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0789 
Symbol 
ID4270552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp878948 
End bp880282 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content63% 
IMG OID638125539 
Productphage integrase family protein 
Protein accessionYP_741633 
Protein GI114319950 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCA AAACACTGGA CACCAGCATT ACTGATGCCG CCACCCGGCG CTTCCTGAAG 
AATGAGCCGG AGCGGGCAAC CTTATGGTGT GAGCGGATTA CCGGGTTTCA CATACTCAAG
GTAAAGACCG GGGGCAGTTG GCGGTATCGG TATCAGACCG CAGACGGTAA GCGCCGCGTG
GCCACCATCG GGCGCTATCC GGCCATGAAA CCGCAGCAGG CCGCCGAGAA GGCACTAGCC
TGGCGCAATG AGAACGTGGA CGCGCTGGCA GTAAAGGCCA AACGCACCCG CAATGCCTTG
ACCGAAGCGC AGAAGGCCGA GCAACGCACG CTGCGGGCCT ATCTCGAAGG GCCGTACGCT
AGGTACCAGG AGCGCAAGCG CAGCGGCGCG GAAACCCTGC GCCTCATTCG GCACAACTTC
ACCGACTGGC TAGACCGGGA CATGGCCACG TTGGACAGCG CAGACGTGAA GGCGTGGCAG
GCCAAGCGTG AAGCCGAGGG CCGGGCATAC TCCACGCTGC AACGCGCCTA TGGTGCGCTG
AAAACGGCCC TGAACCGCGC CGCGAAACAT GACGACCTGC TAGACGCGAA CCCCCTGCGA
GATGTTCACC TTGAGGCCCC ACGGTTTGAC GAGGTGGACA AGCAGGTAAA AGCGGAGCAG
ACCGCCCGCC GCCTGCTGAG TGACGAGGAA GTGACCGGGC TGCATGAGGG CTTGCGAGCC
TTTGCGGATG AGTTGCGCCG CCAGCGCCGT AACAGCCGAG CGCATGGCAA GCCGGATCTG
CCCGACCTGG ACGCGGTGGA CTACCCGCAT TGGTTTATTC CTTTCTGCCA CGTTGCCCTT
TATACCGGAC TGCGACCGGG TGACTTGTAC GCGCTCACCT GGGCGGAGCT GAACCCCACG
TTCGGGCGGC TGAACGTGAT TCCTCAGAAA ACCCGTCACC ACCGCAACCC CGCCCGCGTG
ACAATGGATT TAGTGCCGGA GTTGCTGGAG GTGGTGCGGG CCTGGTGGGC GCAGCAGGGC
AAGCCTGACA CCGGGCTAGT GTTCCCCTCA CCCGTTACCG GCTTGCGGAT GGACAAGAAA
GCCCACCTGA AAGCCTGGCG ACGTGTGAAG CGCCTTGGTG GGCTGCCCGA CGATCTGGAC
TTCTACACCT TGCGGCACAA CTTTATTTCC AAGCTGGTGG CCGCTGGCGT GCCCCTGCTG
ACAGTGGCGC AACTGGTGGG CCACAAATCG GCGGGGATGA TCGAACAACA TTACGGCGAC
CTGTGCCCCA CCGTAGCGGC TGATGCGTTG GCCGTGTTTT CCAAGTCTGT GACACCGAAG
CAGGAGGCGG TATGA
 
Protein sequence
MPAKTLDTSI TDAATRRFLK NEPERATLWC ERITGFHILK VKTGGSWRYR YQTADGKRRV 
ATIGRYPAMK PQQAAEKALA WRNENVDALA VKAKRTRNAL TEAQKAEQRT LRAYLEGPYA
RYQERKRSGA ETLRLIRHNF TDWLDRDMAT LDSADVKAWQ AKREAEGRAY STLQRAYGAL
KTALNRAAKH DDLLDANPLR DVHLEAPRFD EVDKQVKAEQ TARRLLSDEE VTGLHEGLRA
FADELRRQRR NSRAHGKPDL PDLDAVDYPH WFIPFCHVAL YTGLRPGDLY ALTWAELNPT
FGRLNVIPQK TRHHRNPARV TMDLVPELLE VVRAWWAQQG KPDTGLVFPS PVTGLRMDKK
AHLKAWRRVK RLGGLPDDLD FYTLRHNFIS KLVAAGVPLL TVAQLVGHKS AGMIEQHYGD
LCPTVAADAL AVFSKSVTPK QEAV