Gene Mlg_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2004 
Symbol 
ID4270478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2272692 
End bp2274560 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content68% 
IMG OID638126760 
Productcyclic nucleotide-binding protein 
Protein accessionYP_742836 
Protein GI114321153 
COG category[T] Signal transduction mechanisms 
COG ID[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.429129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATCG AACGCCTGGC CTCGGCAGAG GCCTTCGGTG CCGTGGACAA CGAAGCCTTG 
GGCGCCCTGC TGGCTCACGG CACCGAACAC CGCCTAGAGG ATGACGAGGT CATCTTCCGC
ATCGGGGAGC CGGATAAGAA TATCCTGTAC ATTGTCTACG ACGGTCGCGT CACCCTGACC
GGCACCGACG GGAGTAGCAC CCTGTACGAG GCCGTGGCGC TGCTCGGTCT CTCCAGCTAC
TTCGACGACG AGCCCTACTC CCTGACCGCC CGGGCCGCGG GGCCCTGCCA GCTCATCGAG
GTACCGTTCA GCGCTGTACG GTGCCTGGAG CGGGAGCACC CTGCCCTCGC CGACGCCTTG
AGTCACATCA TTGCCGACCG CATCCGCGCC GGGACCGCCC AGTGGCGCAC CGGGGCGGTG
GGCGCCCTTT CACAGCCCGT CCGCTGGGCC ATGACCGCGC CCCTGGTCTG CCTGCCAGGT
GGCGCCAGCC TGACTGAGGC GTACCAGCTC ATGGCCGCCC GCGGCATCGG CAGCCTGGGG
GTGGTGGACG GGCAGGACCG GCTGATCGGG CTGGCCACCA TCAAATCCCT CGCCCACCGG
CTGCTCATCG AACGCCAGAG CCCGGATGCG CAGCTGTCCA CCGCCGCCAT GCCGGCCAGG
CGCATTTCGC CGGATGCCGC CCTGTGGCAG GCGGAGGAGA TCCAGGCACG GGAGGCGCTG
AAGTACCTGG TGGTGGCGGA AAACGACCGC CCCCTGGGCA TGCTCTCGCA GAGCAACCTC
ATCGAGACCA TCCGCGCCCA CCAGACCCTG TTGCGTGACC GGGTGGAGCG TACGGGCACC
CTGGCCGACC TGCAGGCCCT CGCCGCCGAG TTACCCAGCG TCGCCCGCCA GGCGCGTGAG
GCCAACCGCG AGGCCAGCCG GGCGGTAACC CAGCTCAGTG AATTCCACCT GCAGCTCCAG
CGACGCTGCG TCGAGATACT CCTCACGGAG TTGGCATCGG AAGGTCTCGG CCCCCCGCCC
CGGCGCTATG CCCTGCTGGT CATGGGATCG CTGGCCCGGC GGGAGTCGCT GCTCAATCCG
GATCAGGACA ACGCCCTGAT CATCGCCGAT AACCGGCCTG GCGAGACGGA GCCGAGCCCC
CTCGACGACC GCGAGCGGCG TTGGTTCGAC ACCTTTGCTG ATCGCCTGAA TCACCGGCTG
GACGCCCTGG GCTACGACTG GTGCCAGGGT CACATCATGG CCCGCACGCC GGTATGGCAC
CGTCAACTAG GTCAGTGGCG GCAATGGGTC ATGACGGCGA CACGACACCC GAACGGAGAC
CGGGCCCGCT GGGCCAATAT CTTCCTGGAC TTTGAACTGC TGCAGGGCGA TGCCGGCCTG
GTGGATGCGC TCTGGCGGGC GGTTCTGGAG GCCTTTGCGC AGCACCGAAA GCTGCTGCGG
TTCATGGCCG CCGACGATGC CGAGGGCACA CCGGCGCTCG GGCTGTTCAA CCGGTTGGTC
ACCTCCGAGC GCGAAGAGGC CCGGGGTCGG GTCGATATCA AACGCAACGG CATGCGCATC
CTGGCCAATG GCTGCCGCAT TTACGCCCTT GGCCACCAGG TGCGGGCGAC CGGCACCCTG
GCCCGCTTGC AGGCCCTGCG CCACCAAGGC GTGCTCACGC CGGACAAGGT GGATTCGTTG
GTGGCCGCCC AGGAGGCCCT GCTGGGCCTG CTACTGGACC ACCAGCTTCG GCAGTGGGGC
GAGGGCCGGC GCCCGGACAA GTACATCGCC CCGGACGCGC TGGACGAGAT GCAGCGCCAG
GCGTTGGTCA CCAGCCTGCG CGCCATTCGC CGGTTCCAGG ACCGGGTGCA AGGCGTCTTC
GGCCTTTAG
 
Protein sequence
MEIERLASAE AFGAVDNEAL GALLAHGTEH RLEDDEVIFR IGEPDKNILY IVYDGRVTLT 
GTDGSSTLYE AVALLGLSSY FDDEPYSLTA RAAGPCQLIE VPFSAVRCLE REHPALADAL
SHIIADRIRA GTAQWRTGAV GALSQPVRWA MTAPLVCLPG GASLTEAYQL MAARGIGSLG
VVDGQDRLIG LATIKSLAHR LLIERQSPDA QLSTAAMPAR RISPDAALWQ AEEIQAREAL
KYLVVAENDR PLGMLSQSNL IETIRAHQTL LRDRVERTGT LADLQALAAE LPSVARQARE
ANREASRAVT QLSEFHLQLQ RRCVEILLTE LASEGLGPPP RRYALLVMGS LARRESLLNP
DQDNALIIAD NRPGETEPSP LDDRERRWFD TFADRLNHRL DALGYDWCQG HIMARTPVWH
RQLGQWRQWV MTATRHPNGD RARWANIFLD FELLQGDAGL VDALWRAVLE AFAQHRKLLR
FMAADDAEGT PALGLFNRLV TSEREEARGR VDIKRNGMRI LANGCRIYAL GHQVRATGTL
ARLQALRHQG VLTPDKVDSL VAAQEALLGL LLDHQLRQWG EGRRPDKYIA PDALDEMQRQ
ALVTSLRAIR RFQDRVQGVF GL