Gene Strop_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1106 
Symbol 
ID5057553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1250486 
End bp1252024 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content73% 
IMG OID640473373 
Producthistidine ammonia-lyase 
Protein accessionYP_001157955 
Protein GI145593658 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.825697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACCG TAGTCATTCA GCCAACCGGG GTCACCCCCG CCGACGTGCT CGCCGTCGCC 
CGCGGCAACG CCAAGGTCGT CCTCGACCCG GCAGCGATCG ACGCGATGAC CGCCAGCCGG
TCCGTCGTGG ACGGTATCGA GGCCGCCGGC CAGCCGGTGT ATGGCGTGAG CACCGGCTTC
GGAGCCCTCG CCAACACCTT CGTCGCCCCA CAACGGCGGG CGGAGCTACA GCACGCACTG
ATCCGTTCGC ACGCCGCCGG GGTGGGAACC GCCATGCCGC GCGAGGTGGT GCGGGCGATG
ATGCTGCTGC GTGTCCGTTC CCTCGCATTC GGCCGCTCCG GGGTCCGGCC GTTGGTTGCC
AACGCGCTGG TGGACCTGCT CAACCACGAT GTCACCCCGT GGGTGCCCGA GCACGGGTCG
CTGGGGGCCT CTGGTGACCT GGCGCCGCTG GCGCACTGCG CGCTGGCGCT ACTCGGCGAG
GGCTGGGTGC TGGGCGCGGC CGGCGACCGG ATTCCGGCCA GCGAGGCGCT ACGCCGGGCC
GGTCTCCCGC CGATCGAGCT GGCCGCCAAG GAAGGGCTGG CACTGATCAA CGGCACCGAC
GGAATGCTCG GCATGCTGCT GATGGCAAGC GACGACGCCG CACACCTGTT TACTCTGGCC
GATGTGACGG CCGCCCTGGC CGTCGAGGCG ATGCTCGGCT CGGACCGGCC GTTCCGGCCC
GAGTTGCACA CGATCCGGCC GCACCCCGGT CAGGCCGCCT CGGCGGCCAA CATCCACCGT
CTGCTCCAGG ACTCGGCGGT GATGGAGTCG CACCGCGACG ACGTGGTGCA CGCGGTGCAG
GACGCGTACT CGATGCGGTG CGCGCCGCAG GTCGCCGGCG CGGCCCGGGA CACCCTGGAC
TTCGCCCGAC AGGTGGCAGG CCGGGAACTG ATCTCGGTGG TGGACAATCC GGTGGTCCTG
CTGGACGGCC GGGTCGAGTC GACCGGGAAC TTCCACGGCG CCCCGCTCGG CTTCGCCGCG
GACTTCCTCG CCGTCGCCGC CGCCGAGGTC GGCGCGATCG CCGAGCGGCG GGTGGACCGC
CTGCTCGACG TGACCCGCTC CCGGGACCTA CCGGCCTTCC TCTCCCCCGA CGCCGGGGTC
AACTCCGGAC TGATGATCGC CCAGTACACG GCGGCCGGCA TCGTCGCGGA GAACCGCCGG
CTCGCCGCCC CCGCCTCGGT GGACTCCCTG CCCACCAGCG GCATGCAGGA AGACCACGTG
TCGATGGGCT GGGCGGCGAC CAAGAAGCTA CGGACCGTCC TGGACAACCT AACCAGTCTG
CTCGCGGTCG AGCTGCTCGC CGCGGTCCGC GGGCTCCAGC TGCGGGCGCC GCTACAACCG
TCGCCGGCCG GACGCGCCGC CATCGCCGCG TTGACCGGGG CCGCCGGGGA GCCCGGCCCG
GACATCTTCC TTGCTCCGGT GCTGGAGGCC GCCCGTGAGG TGGTTGCCGG CCCGGAGCTT
CGGGCCGCGA TCGAACGCGA GGTCGGAACG CTGGCCTGA
 
Protein sequence
MSTVVIQPTG VTPADVLAVA RGNAKVVLDP AAIDAMTASR SVVDGIEAAG QPVYGVSTGF 
GALANTFVAP QRRAELQHAL IRSHAAGVGT AMPREVVRAM MLLRVRSLAF GRSGVRPLVA
NALVDLLNHD VTPWVPEHGS LGASGDLAPL AHCALALLGE GWVLGAAGDR IPASEALRRA
GLPPIELAAK EGLALINGTD GMLGMLLMAS DDAAHLFTLA DVTAALAVEA MLGSDRPFRP
ELHTIRPHPG QAASAANIHR LLQDSAVMES HRDDVVHAVQ DAYSMRCAPQ VAGAARDTLD
FARQVAGREL ISVVDNPVVL LDGRVESTGN FHGAPLGFAA DFLAVAAAEV GAIAERRVDR
LLDVTRSRDL PAFLSPDAGV NSGLMIAQYT AAGIVAENRR LAAPASVDSL PTSGMQEDHV
SMGWAATKKL RTVLDNLTSL LAVELLAAVR GLQLRAPLQP SPAGRAAIAA LTGAAGEPGP
DIFLAPVLEA AREVVAGPEL RAAIEREVGT LA