Gene Mlg_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1061 
Symbol 
ID4268982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1239341 
End bp1240735 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content66% 
IMG OID638125812 
Productphage integrase family protein 
Protein accessionYP_741903 
Protein GI114320220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.69421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.669577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCA AGGGTATTAC GGCCAAGAAA CTGGAGGCGC TCCAGGGCAA GCGCCGCAAG 
TCGGCCACCA GGGAGTGGGA CGGTGACGGT TCAGGGTTCG GGGTGAAGGT GTCGGCAGCC
GGGCGGCTGA CATTCTTTCA GTTCTACTAC TCCCCCGAGG GCACCACCGA TAAAGACGGT
AACGACATTA CCGGCAAGCG GCGGTTCATG GGGCTGGGTA ACTACCCTGA GACAACGCTC
GCAGAGGCGC GGGAGAAGGC GCAGGAGGCG CGAGAGTTGC TAGAGCGGGG CATTGATCCG
CAGGAGCACG CCCGAGAGCA GCAGGAGGCG CACAGGCGGG AGAAACGCAA GCGGGCGCAA
CGGGGCACGC TGGCAGGGGT GGCGGCGCTG TACCTGTGGC ACATGCGCAA GCGCGGGCGC
TCCAGGGAGT ACATTACCGC CGTTCGCCGT GGGTTCCACC GCGACGTGTT CCCGGTGGTG
CCCCGTGACA CCAAGGCCGG GGACGTGGAG CCGGAGGACG TGCAACTGAT CCTGCACCGG
CCCCTAAGCC GTGGCGCTGA CCATACCGCC CGAGTGCTGC GGGCTAACCT GCATCGCGCG
TTCAAGTTGG CCATTCAGGC GGATAACGAC CCGCGCAACC TGGGCAGTGC CGTTAAGTTC
CGCGTGCGCC ACAACCCGGT GGAGGACGTG CCGCTTGAGG TGCACGTTAC GCCCGGAGAT
CGGGAGCTTT CATTTAGCGA GATCGGGCGC GTATGGCGGG AGGCCGACCA CGCGACCCCT
TACCCGCAGG ATGCGTTGCT GTTGCGCCTG CTGCTGGCGC TGGGCGGCCA GCACATTACC
GAGCTGAGGG AAGCACAGTG GCCCGAGTTC GATTTGCAGG CCGGACAGTG GCACCTGAAA
GCCGCCCGGC ACAAAAACCG CACCCGCGAC CACCTGGTAC CAATCAACAG CACCGCCGCC
GAAGTGCTGG AGGAATTGCG GGCGCTGACT GGTGGGCAAG GGTACCTGTT CCCGCAGCTA
CGCAACGCGC ACAAGCCCAT GCGAGCCGAG AGGCCGGGCG CTATCGTTCG CGGTCTGCTG
GCGCACCTGG AGGCGCAGGG CGAGCCGATG GAGAAGTTTA CGGCCTCAGA CTTCCGGCGC
ACCTGCAAAA CGCGGATGCA CGAGATAGGG ATTCCCAAAA CCACCACCAA CCACCTGCAC
AACCATGACT TTGGTGGCGT GAGCGCGAAG CACTACGACC GCTACGACTA TTGGGGCGAG
AAACAGCGGG CCATGCGGGC ATGGGATATT GCGCTGAAAG CCGCCATTGC AGGCGAGCCG
GTGCCCGAGG CGCGTTGCCG GGCGGCGCTC CAGTGGGACG AGAATGGCGG GGCGCGGCTG
GAGGTGGTGG GCTAG
 
Protein sequence
MARKGITAKK LEALQGKRRK SATREWDGDG SGFGVKVSAA GRLTFFQFYY SPEGTTDKDG 
NDITGKRRFM GLGNYPETTL AEAREKAQEA RELLERGIDP QEHAREQQEA HRREKRKRAQ
RGTLAGVAAL YLWHMRKRGR SREYITAVRR GFHRDVFPVV PRDTKAGDVE PEDVQLILHR
PLSRGADHTA RVLRANLHRA FKLAIQADND PRNLGSAVKF RVRHNPVEDV PLEVHVTPGD
RELSFSEIGR VWREADHATP YPQDALLLRL LLALGGQHIT ELREAQWPEF DLQAGQWHLK
AARHKNRTRD HLVPINSTAA EVLEELRALT GGQGYLFPQL RNAHKPMRAE RPGAIVRGLL
AHLEAQGEPM EKFTASDFRR TCKTRMHEIG IPKTTTNHLH NHDFGGVSAK HYDRYDYWGE
KQRAMRAWDI ALKAAIAGEP VPEARCRAAL QWDENGGARL EVVG