Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_6152 |
Symbol | |
ID | 8669451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 6748579 |
End bp | 6750126 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | Histidine ammonia-lyase |
Protein accession | YP_003341625 |
Protein GI | 271967429 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00526934 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCGACA ACGAGGTCGT GAACATCGGT CCGGAGCCGC TGAGCTTCGA CGAGGTCGTC AGGGTGGCCC GGCACGGCGC CGCGGTACGT CTCACCGACG ACGCCGTGGC GGCCATGGCC GCGGCCCGCA CCCGCGTGGA CGAGCTCGCC GAGAGCCCCA CCCCCGCCTA CGGGGTGTCG ACCGGGTTCG GGGCCCTGGC CACCCGGCAC ATCGACCCCT CACTCCGTGC CCAGTTGCAG CGCTCCCTGG TCCGATCGCA CGCGGCGGGC TCCGGTCCCG AGGTGGAGAT CGAGGTCACC CGGGCGCTGA TGCTGCTGCG CCTGCACACC CTGGCCACCG GGCACACCGG CGTGCGGCCG AAGACAGCCA AGGTGCTCCA GGACCTGCTC AACGCCCGCA TCACCCCCGT GGTGCACGAG TACGGCAGCC TCGGCTGCTC CGGCGACCTC GCGCCGCTCT CCCACGTCGC GCTCACGATC ATGGGCGAGG GCGTCGTCCG CGACGCGTCC GGAGAGCGTG TGGACGCCGC CGAGGCGCTG AAGCAGGCGG GCATCGAGCC CGTCGAGCTC GCCGCCAAGG AGGGCCTCGC GCTCATCAAC GGCACCGACG GCATGCTCGG CATGCTGATC CTGGCCATGG ACGACCTCGG CCGCCTGTTC AAGACCGCCG ACGTCAGCGC GGCCATGAGC GTGGAGGCGC TGCTCGGCAC CGACCGCGTC TTCGCCGCCG ACCTGCAGGC CCTGCGCCCG CACCCGGGAC AGGCGGCCAG CGCCGCCAAC CTCCGGGCCC TGCTGGCGGA CTCCGGGATC ATGGACTCGC ACCGGGACGG CACCTGCACC CGCGTCCAGG ACGCCTACTC CCTGCGCTGC GCCCCGCAGG TCGCCGGAGC CGCCCGCGAC ACCCTCGCGC ACGCCGCCGC GGTCGCCTCG CGGGAGCTCG CCTCCGCGAT CGACAACCCG GTGGTCCTCG CCGACGGCCG GGTGGAGTCC AACGGCAACT TCCACGGCGC CCCCGTGGGC TACGTGCTCG ACTTCCTCGC CATCGCGGTC GCCGACATGG CGAGCATCTC CGAGCGCCGT ACCGACCGCA TGCTCGACGT GGCCCGCAGC CACGGCCTGC CCGCCTTCCT CGCCGACGAC CCCGGCGTGG ACTCCGGGCA CATGATCGCC CAGTACACCC AGGCCGCGAT CGTCTCCGAG CTCAAGCGCC TGGCCGTGCC CGCCAGCGTG GACTCCATCC CGAGCTCCGC CATGCAGGAG GACCACGTCT CCATGGGCTG GTCGGCGGCC CGCAAGCTGC GCCGCTCGGT GGACGGCCTG ACCCGGGTGC TCGCGGTGGA GATCCTCACC GCGGCACGGG CTCTCGACCT GCGGGCGCCG CACCGGCCCG CCCCCGCGAC CGGGGCCGTG GTCGCCTCCC TGCGCCAGTC GGTCCCCGGC CCCGGTCCGG ACCGCTTCCT CGCCCCGGAG ATCGAGGCGG TGGTCCGGCT CGTCGCCGAC AACACCGTCG TCGTCGCGGC GGAGTCGGTC ACCGGCCCGC TGGGATAG
|
Protein sequence | MRDNEVVNIG PEPLSFDEVV RVARHGAAVR LTDDAVAAMA AARTRVDELA ESPTPAYGVS TGFGALATRH IDPSLRAQLQ RSLVRSHAAG SGPEVEIEVT RALMLLRLHT LATGHTGVRP KTAKVLQDLL NARITPVVHE YGSLGCSGDL APLSHVALTI MGEGVVRDAS GERVDAAEAL KQAGIEPVEL AAKEGLALIN GTDGMLGMLI LAMDDLGRLF KTADVSAAMS VEALLGTDRV FAADLQALRP HPGQAASAAN LRALLADSGI MDSHRDGTCT RVQDAYSLRC APQVAGAARD TLAHAAAVAS RELASAIDNP VVLADGRVES NGNFHGAPVG YVLDFLAIAV ADMASISERR TDRMLDVARS HGLPAFLADD PGVDSGHMIA QYTQAAIVSE LKRLAVPASV DSIPSSAMQE DHVSMGWSAA RKLRRSVDGL TRVLAVEILT AARALDLRAP HRPAPATGAV VASLRQSVPG PGPDRFLAPE IEAVVRLVAD NTVVVAAESV TGPLG
|
| |