Gene Mmar10_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1601 
Symbol 
ID4283923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1753227 
End bp1754894 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content65% 
IMG OID638141088 
Producturocanate hydratase 
Protein accessionYP_756831 
Protein GI114570151 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.308587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA CCCGCCGCGA CAATTCCCGC ATCATCCGCG CCAAGCATGG CTCAGAGCTC 
GACGCTACCC ATTGGGCCGC CGAGGCGCCC TTGCGCATGC TGATGAATAA TCTCGACCCC
GACGTGGCGG AGAAGCCGGA AGAGCTGGTT GTCTATGGCG GGATCGGCCG GGCCGCGCGC
GACTGGGAGA GCTATGACCG CATCGTCGCG ACGCTGAAAC GCCTGAAGGA AGACGAGACC
CTGCTGGTCC AGTCCGGCAA GCCGGTCGGG GTCTTCCGCA CTCACAAGGA TGCCCCGCGC
GTGCTGATCG CCAATTCCAA TCTCGTGCCC AACTGGGCCA ATTGGGATCA TTTCCGCGAG
CTCGATAAGA AGGGCCTGAT GATGTACGGC CAGATGACGG CCGGCTCCTG GATCTATATC
GGCTCGCAAG GCATCGTTCA GGGCACGTAT GAGACTTTCG TTGAGGCCGG TCGCCAGCAT
TATGACGGTG ACCTCACCGG CAAGTGGATC CTGACCGGCG GGCTTGGCGG CATGGGCGGC
GCCCAGCCGC TGGCTGCGAC GATGGCCGGC GCCTCCATGC TGGCGGTGGA ATGCCAGCCG
AGCCGGATCG AGATGCGGCT GAAGACCGGC TATCTCGACA AGAGCGCCAC CACGCTGGAC
GAGGCGCTGG AAATCATCAA CGCGGCCTGC GCCAAGGGCG AGGCGGTCTC GGTCGGCCTG
CTGGGCAATG CGGCCGAGGT CTTCCCGGAA CTGGTGAAAC GCGGCGTCAA GCCGGACATG
GTCACCGACC AGACCTCCGC CCATGACCCC GCAAACGGCT ATCTGCCCGC TGGCTGGACC
CTCGCCGAAT GGGACGAAAA ACGCGAGAGC GATCCCGCCG CCGTCGAAGC GGCCGCAAAA
GCCTCCATGG CGGAGCAGGT CAAAGCCATG CTGGCCTTTT GGGAGCAGGG CATCCCGACG
CTCGATTATG GAAACAATAT CCGCCAGATG GCCTTTGACG AGGGCGTTAC CAACGCCTTC
GATTTTCCCG GCTTCGTGCC GGCCTATATC CGCCCGCTGT TTTGCCGCGG CATTGGCCCT
TTCCGCTGGG CGGCGCTGTC GGGTGATCCC GAAGACATCT ACAAGACCGA CGCCAAGGTG
AAGGAACTGA TCCCGGACAA TCCGCACCTT CACCGCTGGC TCGACATGGC GCGCGAGCGC
ATCCACTTCC AGGGCCTGCC GGCGCGGATC TGCTGGGTCG GCCTGGGCGA GCGCCACAAG
CTGGGCCTCG CCTTCAACGA GATGGTCCGC ACGGGCGAGC TCTCCGCCCC CGTCGTAATC
GGCCGCGACC ATCTCGACTC AGGCTCGGTC GCCAGCCCGA ACCGAGAAAC CGAAGCGATG
ATGGACGGCT CAGACGCCGT CGCCGACTGG CCGCTCCTCA ACGCGCTCCT CAACACGGCT
TCGGGCGCCA CCTGGGTCTC GTTGCATCAT GGTGGCGGCG TCGGCATGGG CTATTCCCTG
CACTCCGGCC AGGTCGTCCT CGCCGACGGC ACGGTTGAGG CCGCAGAGCG CGTTGGTCGG
GTGTTGTGGA ATGATCCGGG CACGGGCGTG ATGCGTCACG CCGATGCCGG CTATGAGATC
GCGAAGGACT GCGCGAAGGA GCAGGGGCTG GATCTGCCGA GTGTGTAG
 
Protein sequence
MTQTRRDNSR IIRAKHGSEL DATHWAAEAP LRMLMNNLDP DVAEKPEELV VYGGIGRAAR 
DWESYDRIVA TLKRLKEDET LLVQSGKPVG VFRTHKDAPR VLIANSNLVP NWANWDHFRE
LDKKGLMMYG QMTAGSWIYI GSQGIVQGTY ETFVEAGRQH YDGDLTGKWI LTGGLGGMGG
AQPLAATMAG ASMLAVECQP SRIEMRLKTG YLDKSATTLD EALEIINAAC AKGEAVSVGL
LGNAAEVFPE LVKRGVKPDM VTDQTSAHDP ANGYLPAGWT LAEWDEKRES DPAAVEAAAK
ASMAEQVKAM LAFWEQGIPT LDYGNNIRQM AFDEGVTNAF DFPGFVPAYI RPLFCRGIGP
FRWAALSGDP EDIYKTDAKV KELIPDNPHL HRWLDMARER IHFQGLPARI CWVGLGERHK
LGLAFNEMVR TGELSAPVVI GRDHLDSGSV ASPNRETEAM MDGSDAVADW PLLNALLNTA
SGATWVSLHH GGGVGMGYSL HSGQVVLADG TVEAAERVGR VLWNDPGTGV MRHADAGYEI
AKDCAKEQGL DLPSV