Gene M446_1651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1651 
Symbol 
ID6128922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1845763 
End bp1847442 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content72% 
IMG OID641641909 
Producturocanate hydratase 
Protein accessionYP_001768578 
Protein GI170739923 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.506607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.280381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCC TCGACGATCC CCGCCTCGAC AATGCCCGCC TCGTCCGCGC CCCGCGCGGC 
CCCGCCCTCA CGGCGCGGAG CTGGCTCACC GAGGCGCCCC TGCGCATGCT GATGAACAAC
CTCGACCCGG AGGTCGCCGA GCGGCCGGGC GAGCTCGTGG TCTATGGCGG CATTGGGCGG
GCGGCCCGGG ACTGGGCGAG CTTCGACCGG ATCGTCGCCG CCCTCACGGA CCTGGAGGAG
GATCAGACGC TGCTGGTCCA GTCCGGCAAG CCGGTCGGGA TCTTCCGCAC CCACGCGGAC
GCGCCCCGCG TGCTGATCGC CAACTCCAAC CTCGTGCCGC ACTGGGCGAC CTGGGCGCAT
TTCCACGAGC TCGATCGCCG GGGCCTGATG ATGTACGGCC AGATGACGGC CGGCTCCTGG
ATCTACATCG GCAGCCAGGG GATCGTGCAG GGCACCTACG AGACCTTCGT GGAGATGGGC
CGCCAGCACC ATGGCGGCGA CCTCGCCGGG CGCTGGATCC TGACGGCGGG CCTGGGCGGC
ATGGGCGGGG CGCAGCCCCT CGCCGCCACG ATGGCGGGCG CCTCCTGCCT CGCCGTCGAG
TGCCGGGCCT CCAGCATCGA GTTCCGCCTT CGCACGGGCT ACGTCGACGT GCAGGCCCGG
GACCTCGACG AGGCGCTCGC CCTGATCGAG GAATCCTGCC GGGCGCGCAC GCCCCGCTCG
GTCGCGCTCC TGGGCAACGC CGCCGAAATC TACGCCGAGA TCTGGCGCCG GGGCGTGCGG
CCGGATTGCG TCACCGACCA GACCTCCGCC CACGATCCCG TCAACGGCTA CCTGCCCCGG
GGCTGGAGCC TCGCCGAGTG GGAAACCCGC CGCGAGAGCG ACCCGGACGG GGTCGCGGCG
GCGGCCAAGC GCTCCATGGC CGAGCAGGTT CGGGTGATGC TGGACTTCCA CCGGGCCGGC
GTGCCCGTGG TCGATTACGG CAACAACATC CGGCAGATGG CGCTGGAGGA GGGCGTCCGC
GACGCCTTCG CCTTCCCGGG CTTCGTGCCG GCCTATATCC GCCCGCTGTT CTGCCGCGGC
ATCGGGCCGT TCCGCTGGTG CGCCCTCTCG GGCGATCCCG AGGACATCTA CCGGACCGAC
GCCAAGGTGA AGGAATTGCT GCCCGACAAC GCAGCCCTGC ACCGCTGGCT CGACATGGCG
CGGGAGCGGA TCCGCTTCCA GGGCCTCCCG GCCCGGATCT GCTGGGTCGG CCTCGGCGAC
CGCCACCGCC TCGGCCTCGC CTTCAACGCG ATGGTGCGCA GCGGCGAGCT GAAGGCCCCG
ATCGTGATCG GGCGCGACCA CCTCGATTCC GGTTCGGTCG CCTCGCCGAA CCGCGAGACC
GAGGCGATGC GCGACGGCTC GGACGCGGTC TCCGACTGGC CGCTCCTGAA CGCGCTCCTC
AACACCGCCT CGGGGGCGAC CTGGGTGTCC CTACACCACG GCGGCGGCGT CGGAATGGGC
TTCTCGCAGC ATGCCGGGAT GGTGATCGTC TGCGACGGCA GCGAGGCCGC CGACCGGCGC
CTCGCGCGGG TGCTCTGGAA CGATCCGGCG AGCGGCGTGA TGCGCCACGC CGATGCCGGC
TACCCGGACG CGGTCGCCTG CGCGCGGGAG CACGGTCTGA CCCTGCCGAG CCTCGGCTGA
 
Protein sequence
MSRLDDPRLD NARLVRAPRG PALTARSWLT EAPLRMLMNN LDPEVAERPG ELVVYGGIGR 
AARDWASFDR IVAALTDLEE DQTLLVQSGK PVGIFRTHAD APRVLIANSN LVPHWATWAH
FHELDRRGLM MYGQMTAGSW IYIGSQGIVQ GTYETFVEMG RQHHGGDLAG RWILTAGLGG
MGGAQPLAAT MAGASCLAVE CRASSIEFRL RTGYVDVQAR DLDEALALIE ESCRARTPRS
VALLGNAAEI YAEIWRRGVR PDCVTDQTSA HDPVNGYLPR GWSLAEWETR RESDPDGVAA
AAKRSMAEQV RVMLDFHRAG VPVVDYGNNI RQMALEEGVR DAFAFPGFVP AYIRPLFCRG
IGPFRWCALS GDPEDIYRTD AKVKELLPDN AALHRWLDMA RERIRFQGLP ARICWVGLGD
RHRLGLAFNA MVRSGELKAP IVIGRDHLDS GSVASPNRET EAMRDGSDAV SDWPLLNALL
NTASGATWVS LHHGGGVGMG FSQHAGMVIV CDGSEAADRR LARVLWNDPA SGVMRHADAG
YPDAVACARE HGLTLPSLG