Gene Mmar10_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1603 
Symbol 
ID4283925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1755707 
End bp1757230 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content65% 
IMG OID638141090 
Producthistidine ammonia-lyase 
Protein accessionYP_756833 
Protein GI114570153 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.329412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.712329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACTG TGACCATCAA ACCCGGCGCC ATGGTGCTGG CCGACTGGGC GGACATCTGG 
CGAGGCGCGA CAATCCGGAT CGACCCGGAG GCCAAGGCCG GTGTCGATGC TGCGGCTGCC
ACGGTCGACC GCCTGATTGC TTCGGGCGAG GCGGTTTATG GCCTCAACAC CGGTTTCGGC
AAGCTGGCCC AGACCCGTAT CGCCGATGAC GAGCTGGCGA CCCTGCAGGA ACGGCTCGTC
CTGTCCCATG CCGCCGGGAT TGGCGAAGCA TTGGACAGCC GCATCGTCCG CCTGGTCATG
GCTTTGAAGC TGGCCAGCCT GGGGCGCGGC GCATCGGGCG TGCGCTGGAC AACGATTGCA
GCCATGCAAG CGCTGCTTGA TGCCGACGTG CTGCCGGTCA TCCCGTCGCA GGGCTCGGTT
GGTGCCTCCG GTGATCTGGC ACCATTGGCG CACATGTCCG CCGCCCTGAT CGGGGCAGGT
GAAGCGACGT GGCAGGGGCA GCGCATGCCG GCCTCCGACG CGCTTGCGAA AGCCGGTCTG
AAGCCGGTTC AACTCGGCCC GAAAGAGGGT CTTGCCCTGC TCAACGGAAC CCAGACCTCG
ACCGCGCTGG CTCTTGCCGG CCTGTTCGAG GTCGAGGCCG GCTTTCAGGC TGCCCTTGTA
TCCGGAGCGC TCAGTGTCGA CGCCGCCAAG GGCTCGGTCG CGCCCTTTGA TCCGCGCATT
CACAGCCTGC GCGGCCATCC CGGCCAGATC GATGTCGCGG CGGCGCTGCG GGGCCTGCTC
GATGGCTCCG GCATCCTGTC CAGCCATGAA GGCTGCGAGA AAATCCAGGA CCCGTACTGC
CTGCGTTGCC AGCCCCAGGT CATGGGTGCC GTGCTCGATC TCCTGCGCCA GGCCGGCGCC
GTGCTGGAGC GCGAAGCCAA TGCGGTCACC GACAATCCGC TGATCTTCAC GGACACGGAC
GAGGCCATTT CCGGCGGCAA TTTCCACGCC GAACCGGTCG CCTTCGCCGC CGACCAGATC
GCCATGGCAG CCTGCGAAAT CGGCTCGATC TGCGAGCGCC GCATCGCGCT TTTGACCGAC
CCGGCGGTGT CCGGTCTTCC GGCTTTCCTG ACACCCAATC CGGGCATCAA TTCCGGCTTC
ATGATCGCCC ATGTCACGGC GGCGGCTCTG GTCTCGGAGA ACAAGCAGAA AGCCTATCCG
GCCTCGGTCG ACAGCATTCC CACCTCGGCC AACCAGGAAG ATCATGTCTC CATGGCCACC
CATGGCGCCT TCCGCCTGCT GAAAATGGCG GAAAACCTCC ATGTCGTGGT CGGCATCGAA
TTGTTGTGCG GGGCGCAAGG GACGGATTTC CATGCCGGGC TGACTTCCTC GCCGACGCTG
GAAACGGCCA AGGCGACATT GCGCAAACAG GTTCCCGCCT ATGGCGATGA TCGCTATTTT
GCCACCGATA TCGCCAATGC GCGTGATCTG GTGACGGGCA GGGGGCTTGT GGCGGATGCG
GGCACGCTTC CCGGGATAGC CTGA
 
Protein sequence
MTTVTIKPGA MVLADWADIW RGATIRIDPE AKAGVDAAAA TVDRLIASGE AVYGLNTGFG 
KLAQTRIADD ELATLQERLV LSHAAGIGEA LDSRIVRLVM ALKLASLGRG ASGVRWTTIA
AMQALLDADV LPVIPSQGSV GASGDLAPLA HMSAALIGAG EATWQGQRMP ASDALAKAGL
KPVQLGPKEG LALLNGTQTS TALALAGLFE VEAGFQAALV SGALSVDAAK GSVAPFDPRI
HSLRGHPGQI DVAAALRGLL DGSGILSSHE GCEKIQDPYC LRCQPQVMGA VLDLLRQAGA
VLEREANAVT DNPLIFTDTD EAISGGNFHA EPVAFAADQI AMAACEIGSI CERRIALLTD
PAVSGLPAFL TPNPGINSGF MIAHVTAAAL VSENKQKAYP ASVDSIPTSA NQEDHVSMAT
HGAFRLLKMA ENLHVVVGIE LLCGAQGTDF HAGLTSSPTL ETAKATLRKQ VPAYGDDRYF
ATDIANARDL VTGRGLVADA GTLPGIA