Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3094 |
Symbol | |
ID | 8884293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 3265806 |
End bp | 3267347 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | histidine ammonia-lyase |
Protein accession | YP_003511858 |
Protein GI | 291300580 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGATC TTCTACTGAC CGGAGACGGC TACTCCATCG AGGACGTCCA CGCGGTGGCC CACCGCCGCG CCCGGGTACG CCCCGGCCCC GGACTGGCCG AGCGGATGGC CTCCGCCCGC GCCGTCGTCA CCGAGGCGGT GCGCCGCAAG GACGTCGTCT ACGGCGTCAC CACCGGCTTC GGGGCGCTGG CCGACACCAC GATCGGCGCC GACGACCTCG CCAAACTCCA GGTCGGGATC GTGCGCAGCC ACGCCGCCGC CGTGGGCACC CCGCTGTCGG ACCCCATCGT CCGCTCGCTG CTGCTGCTGC GGGCCCGCAC CCTGTCGGCC GGATACTCCG GCGTCCGCCC CGAACTGCCG CTGTTCTTCC TGACCCTGCT GGAGCACGAC CTGCTGCCGG TGATCCCGGA GAAGGGCTCG GTGGGCGCCT CGGGCGACCT GGCCCAGCTG GCGCACCTGG CGCTGCCGGT CATCGGCGAG GGACGACTGC GCGCCCCCGG CGACCCCGTC GCGGGCCGCC CCGCCGCCGA AGTGCTGGCC GAACACGACA TCACGCCGCT GGTCCTCGAC CCCAAGGAGG GCCTGTCGCT CATCAACGGC ACCGAGCCGA TGCAGGCGGT ACTGGCGCTG GCCATCATGG AGGCCGAGGC GCTGTGCAAG ATCGCCGACA TCGCCTGCGC CATGAGCGTG GAGGCCCTGT ACGGCACCGA CCGCGCCTAC GACCCCCGCG TCCAGGTCAT CCGGCCGCAC CCCGGCCAGC TGGACTCCGC GGCCAACCTG CGCGACCTGC TTGACGGCAG CCCGCTTCTG GCCAGCCACC GCGACAGCGA CCACGCCGTC CAGGACTCCT ACGCGCTGCG CTGCGCCCCG CAGGTCCACG GCTCCAGCCG CGACCTGATC ACGCATTGCC GCAACGTGCT CAGCATCGAA CTGAAGTCCA TCGTCGACAA CCCCGTGGTC GTGCGCGGCC CCGGCGGCGA CGGCTTCGAG GTGATGAGCA CCGGCAACTT CCACGGCCAG CCGCTGGCCT TCTCCGCCGA CGCCCTGGCC ATGGCCTCCG CCGAACTGGG CAGCATCGCC GAACGCCGGG TCTACCGGAT GCTCGACCCG GCCACCTCCC GGGGCCTGCC GCCGTTCCTG GCCCCCGACG CGGGCACCAA CTCCGGGTTC ATGCTGGCGC AGTACACCGC CGCCAGCCTC GTCAGCGAGA ACAAGGTGCT GTGCCATCCG GCCGGCGTGG ACTCCATCGT CACCTCCGGC AACCAGGAGG ACCACGTGTC GATGGGCTGG CACGCGGTCC GCAAGGCCCG CGAGGTCATC GACAACGTCC GCAGCGTGCT GGCCATCGAG CTGCTGTGCG CCGCCCAGGG ACTGGACCTG CGCGGCGACG TCGCGAAACC CAGCCCCGCC ACCGGCGCCG TGCTGGAGCG GGTCCGTCAG GACGTCGCCG CCATGCCCGT CGACCGGGAA CTGGCCCCCC AGATCGAGGC GGTGCGCGAC ATGCTCGACG ACCTCATCGA CGTGGCCGGT GACCTTCGGT AA
|
Protein sequence | MRDLLLTGDG YSIEDVHAVA HRRARVRPGP GLAERMASAR AVVTEAVRRK DVVYGVTTGF GALADTTIGA DDLAKLQVGI VRSHAAAVGT PLSDPIVRSL LLLRARTLSA GYSGVRPELP LFFLTLLEHD LLPVIPEKGS VGASGDLAQL AHLALPVIGE GRLRAPGDPV AGRPAAEVLA EHDITPLVLD PKEGLSLING TEPMQAVLAL AIMEAEALCK IADIACAMSV EALYGTDRAY DPRVQVIRPH PGQLDSAANL RDLLDGSPLL ASHRDSDHAV QDSYALRCAP QVHGSSRDLI THCRNVLSIE LKSIVDNPVV VRGPGGDGFE VMSTGNFHGQ PLAFSADALA MASAELGSIA ERRVYRMLDP ATSRGLPPFL APDAGTNSGF MLAQYTAASL VSENKVLCHP AGVDSIVTSG NQEDHVSMGW HAVRKAREVI DNVRSVLAIE LLCAAQGLDL RGDVAKPSPA TGAVLERVRQ DVAAMPVDRE LAPQIEAVRD MLDDLIDVAG DLR
|
| |