Gene Mlg_0911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0911 
Symbol 
ID4269296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1029991 
End bp1031319 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content69% 
IMG OID638125663 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_741755 
Protein GI114320072 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAGG TCGACAGTCT GGTCTCCGCC CGCTGGGTGA TCCCGGTGGA GCCGGAGGAC 
GCCGTGCTGC CCCACCACAC GGTGGCCATC CGTGACGGGC GCATCATCGA CTGTCTGCCC
ACTGACCAGG CCGAACGGCA GTACGAGGCG GGCCAGCATC AGCGGCTGGC GCGGCACGCA
CTGCTTCCGG GTCTGGTCAA TGCCCACACC CACAACGCCA TGAGCCTGCT GCGCGGGCTG
GCCGATGACC TGCCGCTGAT GACCTGGCTG CAGGATCACA TCTGGCCGGC GGAGGGGCGG
CATGTCTCGC CGGAGTTCGT GCACGACGGC ACGGCGCTGG CGATGGCCGA GATGCTACGC
GGCGGCACCA CCTGCTTTTC CGATATGTAC TTCTTCCCGG AGGTCACCGG CCGGCTGGCG
GACCGGGTGG GGATGCGCGC GGTGCTGGGG ATGATCGTGA TCGACATGCC CACCCCCTAC
GGTAGCGGGC CGGAGGACTA CCTGGACAAG GGCGTGGCCC TGCACGACGC CTGGCGCAAC
CACCCGCATA TCTCGACGGT GTTCGCCCCT CACGCACCCT ACACCGTCTC GCCGGAGTGG
CTGAAGCGGG TGCGGGTGCT GGCCGACCAA CTGGACACCC GGGTCCACAT GCATGTCCAC
GAGACCGCCG GCGAGGTGGA GGACTGCGTG CAGTCCACCG GTCAGCGGCC GCTGCAGCGG
CTCGACCAAC TCGGGCTGCT CAACCCTTCG CTGATCGCCG TGCACATGAC CCAGCTCACC
GAGGCGGAGA TGGACCGGCT GGCCGAAACC GGTGTCAACG TGGTCCACTG CCCGGAGTCC
AACCTGAAGC TGGGCAGCGG GTTCTGCCCG GTGCACGCCC TGCAGCGGCG CGGGATCCAC
GTGGCCATCG GCACCGACGG CGCGGCCAGC AACAACGACC TGGATCTGCT CGGCGAGCTG
CGCACCGCCG CGCTGTTGGC CAAGGGCTAC AGCGGCAACC CGGCGGCCCT GCCCGCCCAC
CGGGCATTGC GCATGGCCAC CCTGGACGGC GCCCGGGTGC TGGGCCTGGA CGGGGAGATC
GGCTCGCTGG TGCCGGGCAA GTACGCCGAC CTCTGCGCGG TGGATCTCTC CGGTGTGGAG
ACCGAACCGC TGTACAATCC TATCTCGCAG CTGGTCTACA CCGGCCAGCG GGAGCGCGTC
AGCCACGTCT GGGTGGCGGG CCGGCTGCTG CTCAACGAGC GCCGTCTGAC CACCCTGAAC
GAGGCCGATA TCCTGGAACG GACCCGGGCC TGGCAGGCCC GCATCGCCCC GGAGGACACC
CATGACTGA
 
Protein sequence
MQKVDSLVSA RWVIPVEPED AVLPHHTVAI RDGRIIDCLP TDQAERQYEA GQHQRLARHA 
LLPGLVNAHT HNAMSLLRGL ADDLPLMTWL QDHIWPAEGR HVSPEFVHDG TALAMAEMLR
GGTTCFSDMY FFPEVTGRLA DRVGMRAVLG MIVIDMPTPY GSGPEDYLDK GVALHDAWRN
HPHISTVFAP HAPYTVSPEW LKRVRVLADQ LDTRVHMHVH ETAGEVEDCV QSTGQRPLQR
LDQLGLLNPS LIAVHMTQLT EAEMDRLAET GVNVVHCPES NLKLGSGFCP VHALQRRGIH
VAIGTDGAAS NNDLDLLGEL RTAALLAKGY SGNPAALPAH RALRMATLDG ARVLGLDGEI
GSLVPGKYAD LCAVDLSGVE TEPLYNPISQ LVYTGQRERV SHVWVAGRLL LNERRLTTLN
EADILERTRA WQARIAPEDT HD