Gene Arth_0375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0375 
Symbol 
ID4447169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp397652 
End bp399244 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content68% 
IMG OID639688171 
Producthistidine ammonia-lyase 
Protein accessionYP_829876 
Protein GI116668943 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTCA CCACCCACGA ACCGCTGACC GTCACCCTCG GCTCCAGCGG CGTCACCCCC 
GAGGACGTCG TCGCGGTCGC CCGCCACGAC GCCAAGGTGA CCATCTCCCA GGAAGCCCTC
GACACGGTAG CCAAGGTCCG CGCCCACATC GACGACCTCG CCCACAGCGA GGTCCCGGCC
TACGGCATCT CCACCGGTTT CGGCGCGCTG GCCAACCGGC ACATCCCCAA CGAACTGCGC
ACCCAGCTGC AGAAGTCGCT GATCCGCAGC CACGCCGCCG GCATGGGGCC GGCCGTGGAA
CGCGAGGTGG TCCGGTGCAT CATGTTCCTG CGGGCCAAGA CCCTCGCCTC CGGCCGGACA
GGGGTCCGCC CCGTGGTGCT GCAGACCATG GTGGACGTGC TCAACGCCGG CATCACCCCG
GTGGTCCGCG AGTTCGGCTC GCTCGGCTGC TCAGGAGACC TCGCGCCGCT GTCCCACTGC
GCCCTGGTCC TCATGGGCGA AGGCGAAGCA GAAGGTCCGG ACGGCACGCT GTACGGGAAC
AAGGGTCAAA AGCCTGTCGC CGAACTACTG GCCGAGCACG GGATCGAACC CGTCGTCCTC
GCCGAGAAGG AAGGGCTGGC TCTGGTCAAC GGCACCGAAG GCATGCTGGG CATGCTGCTG
ATGGCCATCG CGGACATCCG CCAGCTGCTC ACGACGGCGG ACATCACCGC CGCGCTCAGC
GTCGAGGCGC TGCTCGGCAC CGACCAGGTG TTCCTGCCCG AACTGCACGC GGCGCTCCGC
CCGCACCCGG GCCAGGCTGC GTCCGCGGAC AACATGCTGC GTGTGCTTTC CAACTCGCCG
ATCGTGGCAT CGCACCGGAT CAACGACACC AAGGTCCAGG ATGCCTACTC GCTGCGCTGC
GCACCCCAAG TGGCCGGCGC CGTCCGCGAC ACCGTGGACC ATGCAGCACT GGTCGCCTCG
CGCGAACTCG CCGCCGCCAT CGACAATCCC GTTGTCCTGC CGGACGGCCG CGTGAGTTCC
AACGGCAATT TCCACGGCGC CCCTGTGGCC TACGTTTTGG ATTTCCTGGC AATTACCGTG
GCGGATTTGA GTTCCATTGC CGAACGCCGG ACCGACCGCA TGCTGGACCC GGCGCGTTCA
CACGGCTTGC CGGCCTTCCT GGCTGCGGAC CCCGGCGTCG ACTCCGGCCT GATGATCGCG
CAGTACACAC AGGCCGGGCT GGTCTCGGAC AACAAGCGGC TCGCGGTCCC GGCGTCGGTG
GACTCCATCC CCAGCTCGGC CATGCAGGAA GACCACGTGT CCATGGGCTG GCATGCTGCC
CGGAAACTCC GCAAAGCCGT GGAGAACCTG CGCCGCGTCC TGGCTATCGA GCTGGTCACT
TCGGCGCGGG CCATTGACAT GCGGACGCAG CTATCCGGGG GAAAACTGAC CCCGGGGCCG
GCCGGAACCG CGGTCATTGC GGCGCTCCGC AACGTGGTGG GAGGGCCGGG AACGGACAGG
TTCCTGTCGC CGGAACTGGA GGCCGCGGAC CGGCTCGTCG CCTCCGGCGA GGTACGCGCG
GCAGCCGAAT CCGCCGTCGG AATTCTGGCG TAA
 
Protein sequence
MTLTTHEPLT VTLGSSGVTP EDVVAVARHD AKVTISQEAL DTVAKVRAHI DDLAHSEVPA 
YGISTGFGAL ANRHIPNELR TQLQKSLIRS HAAGMGPAVE REVVRCIMFL RAKTLASGRT
GVRPVVLQTM VDVLNAGITP VVREFGSLGC SGDLAPLSHC ALVLMGEGEA EGPDGTLYGN
KGQKPVAELL AEHGIEPVVL AEKEGLALVN GTEGMLGMLL MAIADIRQLL TTADITAALS
VEALLGTDQV FLPELHAALR PHPGQAASAD NMLRVLSNSP IVASHRINDT KVQDAYSLRC
APQVAGAVRD TVDHAALVAS RELAAAIDNP VVLPDGRVSS NGNFHGAPVA YVLDFLAITV
ADLSSIAERR TDRMLDPARS HGLPAFLAAD PGVDSGLMIA QYTQAGLVSD NKRLAVPASV
DSIPSSAMQE DHVSMGWHAA RKLRKAVENL RRVLAIELVT SARAIDMRTQ LSGGKLTPGP
AGTAVIAALR NVVGGPGTDR FLSPELEAAD RLVASGEVRA AAESAVGILA