Gene Mlg_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2049 
Symbol 
ID4270183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2321784 
End bp2322905 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content74% 
IMG OID638126805 
Productprotein of unknown function DUF294, nucleotidyltransferase putative 
Protein accessionYP_742881 
Protein GI114321198 
COG category[T] Signal transduction mechanisms 
COG ID[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.82123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.388893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGAGA CGCAGGACCA AGGGGCTCCG GGCGCCAGTG AATCAGGTGC CGGTAGCGGG 
GCAACGGCAG GGACCGGTGC GGAGGGCACG GGTTTGGGGC GGGCTCTGGA GCAGGCATCG
GACCTGGCGG CCATCCAGGC CGTTGCGGGC CGCTTCCCCG AGGGTCAGGC CGCGCTGCAG
CGCTCCGGCA GGGACGCGGT GACCCAGGCC CGGATCGTCA GCGGTTGGGT GGACGCCCTG
ACCCGGCGGT TGATTGTCCT CGCGGAGGCG GAAATGGGGA CGCCGGCGCC GGGGCCGTGG
GCCTGGCTCG CGTGCGGCTC CCAGGGGCGG GGTGAGCAGA CCGTGCACAC GGATCAGGAT
AATGCCCTGG TCTATGGGGA TGATCTCCCG CCGGGGGCCG ACGACTGGTA CCGGCGCCTG
GCGGGGCGGG TGACGGAAGG GCTGGCGGCC TGTGGCTTGC CGCATTGCCC GGGCGGCGTC
AGTCCGGCGA ATGGTGATTG GCGGCGCAGC GTCGGCAGTT GGCGGCGGGC GCTGCTCACA
GTCATTGAGG CCCCGGGACG CAAGGCGGTG ATGCTGGCCA CCCATTACCT GGATCTGCGG
GTGGTGGCCG GCGATCCGGC CCTGTTTGAG CCCGTACGCC GGGAGGCCTT GGAGCGGGCG
GCGAGCAACC GCCGGTTCCT CGCCCGGCTG AGCGATGGGG CGACGCGGCC GCGCCCACCG
TGGCATGGCC TGGGGCGGGT CTGGACCCCC TGGTGGGGTG CGAATGCGGG CCGGGTGGAT
CTGAAACAGG GCGGGCTGCT GCCGTTGGTG CAGTTGGCCC GGGTCTATGC CGTGCGCGCG
GGCCTGCCCG CCCACCATAC CCTGGAGCGC CTGCAGCAGG CCGCCGGGGC GGGCACGCTG
GACCGCGGTG AGGCCGGCGC GCTGATACAG GGCTACCAGG TTGTCGCCGG TATCCGCGCC
CGCCTGCACG CCGAGGCCAT CGACGCCGGC CGCCCGCTGC ACAACCAGGT CCCGGTGGCG
GCACTGAGCC GGGGGGAGTA CGCGGCGTTG CGCGCCGCCT TTCGCAGTAT CCTGCGCCGC
CAGCGCGCAT TGCGCCGGGC CGTGGTGCGC GAGGGGCTAT AG
 
Protein sequence
MDETQDQGAP GASESGAGSG ATAGTGAEGT GLGRALEQAS DLAAIQAVAG RFPEGQAALQ 
RSGRDAVTQA RIVSGWVDAL TRRLIVLAEA EMGTPAPGPW AWLACGSQGR GEQTVHTDQD
NALVYGDDLP PGADDWYRRL AGRVTEGLAA CGLPHCPGGV SPANGDWRRS VGSWRRALLT
VIEAPGRKAV MLATHYLDLR VVAGDPALFE PVRREALERA ASNRRFLARL SDGATRPRPP
WHGLGRVWTP WWGANAGRVD LKQGGLLPLV QLARVYAVRA GLPAHHTLER LQQAAGAGTL
DRGEAGALIQ GYQVVAGIRA RLHAEAIDAG RPLHNQVPVA ALSRGEYAAL RAAFRSILRR
QRALRRAVVR EGL