Gene Mlg_2780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2780 
Symbol 
ID4269714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3161183 
End bp3162343 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID638127542 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_743610 
Protein GI114321927 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTTGC CCGATTATCG CGGCGGCAGC ATCGTCAATC TGATGGCCAG CATCAGCCGG 
GGCCTGGGGG CGCCGCGGAT TGGCATCCCG GAGGCGCGAC TGCTGCCGGC GCATCAGATC
GGCCGGGCCC GCCACGTGGT TTTGCTGATG CTGGACGGGC TGGGCTATGA GTATCTGGCC
GGGCATCGCG ATGCCTTGCT GCGCCAACAC CTGCTGGGCG AACTCACCTC GGTCTTCCCC
TCCGCCACCG CCATCGCCGT GACCAGCTTT GCCACCGGCT TGACCCCGCG CCAGCACGGG
GTCACCGGCT GGCACATGTA CCTGGAGGAG CTGGGCCGGG TCTGCACCAT ACTGCCCTTC
CGCGACCGGG TCACCCGTGA GTCCATCGCC GAGGTGGATC CGGACGCCGT GCGGGTGCTG
GAGCAGCCGC CGCTGGCCAA TCTGCTCGAC GCTGAGACCC ACCTGGTGAT GCCGGCGGAG
ATCGCCGACT CCACCTACAA CCTGGCCACC GGGGGGCTGG CCTGGCGGCA CGGGGTGGCG
GACCTGGCGG ACTATGTCGA CACGGTGGCG GGGCTGGTGC AGTCGGCGGG CGGGCGCCAG
TACATCTATG CCTATTGGCC CCGCCTGGAC AGCCTGGCCC ACCAATACGG GATGGCCAGC
ACCGAGGTCC AGGCCCACTT CGCCGCCCTG GACGCCGCCT TCGACGAGCT GCGCCGGGAG
CTGGCCGGCA CCGACACCCT GTTGTTGGTC ACCGCCGACC ACGGGCTTAT CGACATCACC
CCGGACGGGG TGCTGGAGGT GGCCGACCAT CCCGCGCTGG AGGAGACCCT GGCGCTGCCC
ATCTGCGGTG AGCCCCGGGC CGCCTATTGC TATGTCCGCC CGGGGCGGGA GGAGGACTTC
CTCAACTACG TGCAGGGGCC GCTTGCGGGC TGGTGTGATG TCCACACCCC TGGGGAATTG
CTGCAGGCGG GGTGGCTGGG GCCTGGCCCG GCCCACCCCC GGCTGTCGGG GCGGTTGGGC
GACTATGTGC TGGTGATGCG TGACAACCGG GTGATCCACC AGCGGCTCAG CGGCGATGAG
CCATTCTCCC AGATCGGGGT GCATGGCGGC ACCAGCGGCG CGGAGATGCG AGTGCCGCTG
ATGGCCGCAC ACTGCGTTTG A
 
Protein sequence
MILPDYRGGS IVNLMASISR GLGAPRIGIP EARLLPAHQI GRARHVVLLM LDGLGYEYLA 
GHRDALLRQH LLGELTSVFP SATAIAVTSF ATGLTPRQHG VTGWHMYLEE LGRVCTILPF
RDRVTRESIA EVDPDAVRVL EQPPLANLLD AETHLVMPAE IADSTYNLAT GGLAWRHGVA
DLADYVDTVA GLVQSAGGRQ YIYAYWPRLD SLAHQYGMAS TEVQAHFAAL DAAFDELRRE
LAGTDTLLLV TADHGLIDIT PDGVLEVADH PALEETLALP ICGEPRAAYC YVRPGREEDF
LNYVQGPLAG WCDVHTPGEL LQAGWLGPGP AHPRLSGRLG DYVLVMRDNR VIHQRLSGDE
PFSQIGVHGG TSGAEMRVPL MAAHCV