Gene Mlg_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1506 
Symbol 
ID4269863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1719102 
End bp1720547 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content71% 
IMG OID638126264 
Producthypothetical protein 
Protein accessionYP_742345 
Protein GI114320662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.107758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.907122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA AAACACCAAA ACGCCTGCAG GACTGGGTGG AATGGCTGGG CGAGCGCCCC 
CTGCCCCGCC ACCCTGCGCT GGCACCGGAC TTCGGTGCCG AGAGCGCCCT TTTGACCTGG
CAGCTGCAAC AAGACCCGGC AGCGGTCGTG GGCCTGTTGC GCCGTGCGGC GGCAGTGCGC
CACCGTCACC TGGACACGCG CCTGGAGGGC ACCGAGGAGG CCCTGATCAT GCTCGGGCGC
AACGGCGTGG CGGCCTGCTG GGAGGCCCTG CCACCCGCGG ACCGGCTGTT ACACGGTGAG
GCCTTGACCC GCTATCTGCG CTGCCATGCG CGGGCGGTTC ACGCGGCCCG CCAAGCGGAG
GAGTGGGTGC GGTTACGCCA CGACCGGCGG CCCAGCGAAG TGGCCGACGC CACCCTGATA
CGCCACATGG GCGAGCTGAT GTTGCGGGCC CATGCCCCGG AGCGGATGGC CGAGGTGGAC
GCGCTGGCCA CGGACGCCCG ACTGGACGCC GACGCGGAGA CAGCGGTGCT CGGCTTCACC
CTGCAGGACC TGGCCATCAG TCTGGGTAAT CATTGGCGGC TGCCCTACCT GGCGCTGGAA
GACCTGAGCG GCATCCGCCC CCTCTCCCAG CGCAGCCAGG CGGTGTTGCT GGCCCTGCGC
CTGGCCCGGG TGGCGGAGGA CCCGCGGGCG GCCCGGGAGC TGCCGGTGCT GATCCAGGCG
CTGTCGCACT ACATGGGCGA CAGCGAGGAC CACGCCCGCC GGGTGACCCT GGAGACGGCA
CAGGTCATCC ACGAGCAGAC CCCACCCCCC ACCGGCTGGT CGCCGACACT GGCACTGGAC
GGGGACCGCG GGCCGTCGCC ATCCACCCCG GCCCCTTTCT GCCTGGCCCC ACGGGCGGAC
ATCCGCGCCC GGGTGGCAGC GGAATTGGAG CGGGAGGACT TTGACCGGGC ACGCCTGGAG
CTGCTCACCC GCCACCGCCT GGACAACCGC GAGGCGGTGC TTATCAGCCT GGTGCTGACC
GGGCTGCATG AGGGGCTGGG ATTGAACCGG ACACTCTTCC TGCGGGCGCC ACGCCGCGGT
GAGCATCTGC AGTTGTTCTT GCAACGCGGC GCGCTGGGGG ACCCACTGCT GCACGAGATG
ACCGTAACCC CGGCCCACAG CCCCCTGCTG CGTGAGGTGA TCGATCAGGC CCCCGCCTAC
CGGCTGTGCC AGCCGACTCA GGGCAGGGAC CGGCTACCGG AGCCGTTACA GCGCTTCAAC
GGCGGCCAGC CCTGTCTGTT GGCCAGCCTC CGCGTCAACG ACCGGTTGGC CGGGCTGTTC
TACGCCGACC GCCACCTGGC GGGGTGTGGC CTGGATGGGA CGGCGGCCAT GGGCTTCCGG
CATTTCTGCG AACAGGCCGG CCGGCGATTG CAACGTCTTG CGGACGAGCA CCACCACCTG
AGCTAA
 
Protein sequence
MSNKTPKRLQ DWVEWLGERP LPRHPALAPD FGAESALLTW QLQQDPAAVV GLLRRAAAVR 
HRHLDTRLEG TEEALIMLGR NGVAACWEAL PPADRLLHGE ALTRYLRCHA RAVHAARQAE
EWVRLRHDRR PSEVADATLI RHMGELMLRA HAPERMAEVD ALATDARLDA DAETAVLGFT
LQDLAISLGN HWRLPYLALE DLSGIRPLSQ RSQAVLLALR LARVAEDPRA ARELPVLIQA
LSHYMGDSED HARRVTLETA QVIHEQTPPP TGWSPTLALD GDRGPSPSTP APFCLAPRAD
IRARVAAELE REDFDRARLE LLTRHRLDNR EAVLISLVLT GLHEGLGLNR TLFLRAPRRG
EHLQLFLQRG ALGDPLLHEM TVTPAHSPLL REVIDQAPAY RLCQPTQGRD RLPEPLQRFN
GGQPCLLASL RVNDRLAGLF YADRHLAGCG LDGTAAMGFR HFCEQAGRRL QRLADEHHHL
S