Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1579 |
Symbol | |
ID | 4896028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1659865 |
End bp | 1661388 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640112170 |
Product | histidine ammonia-lyase |
Protein accession | YP_001043461 |
Protein GI | 126462347 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.179802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0722817 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATCC TCGTCCCCGG CCGGGCCACG CTCGCGCAGC TCGAAGCGAT CTGGCGCGAG GGGCGGCCCG CGCGTCTGGC CCCCGAGGCG CGCCCCGCCG TCGAGGCGGC GGCCGCCCGC GTGGCCGAGG CCGCGGCCGG CACGGCGCCG GTCTATGGCG TGAACACGGG CTTCGGCAAG CTCGCGAGCC TCAAGATCGC TCCGGCCGAT ACGGCGCAAC TGCAGCGCAA CCTGATCCTG TCGCACTGCT GCGGGGTGGG CGAGCCTATG CCCCCGTCCA CGGCGCGGCT GATGATGGCG CTGAAGCTCC TGTCGCTCGG CCGCGGCGCC TCGGGCGTGC GCTGGGAGAT CGTGGCGCTA CTCGAAGGCA TGCTGGCCGC GGGCGTCACG CCGGTGATCC CGGCGCAGGG GTCGGTCGGC GCGAGCGGCG ATCTGGCGCC CCTCGCCCAC ATGGCCGCAG TCATGATCGG CGAGGGCGAG GCCGAGGTCG GCGGCAGGCG CCTGCCCGGT GCCGCGGCGC TGGCCGAGGC CGGTCTTGCC CCGGTGGCCC TCGGACCCAA GGAAGGGCTC GCCCTCATCA ACGGCACGCA ATTCTCGACC GCCTATGCCC TCGCCGGCCT CTTCGAGGGC TGGCGCGCGG CTCAGGCGGC CCTGGTGATC TCGGCGCTCT CCACCGATGC GATCATGGGT TCGACCGCGC CGCTCCGCCC CGAGATCCAT GCGCTGCGCG GCCATGCGGG CCAGATCGAG GCGGCCGCCA CCATGCGCGC CCTGCTCGAA GGCTCGGCCA TCCGCGAGAG CCACCGTGAG GGCGACCAGC GGGTGCAGGA CCCCTACTGC ATCCGCTGCC AGCCGCAGGT GACGGGCGCC GCGATGGATG TGCTGCGCAT GGCGGCGGGC ACGCTGGCCA CCGAGGCCAA TGCCGCCACC GACAATCCGC TTGTGCTCTC GGACGGGCGC ATCGTCTCGG GAGGCAACTT CCATGCGGAG CCCGTGGGCT TCGCCGCCGA CATGATCGCG CTGGCGCTCT CCGAGATCGG CGCCATCGCG CAGCGCCGCG TGGCGCTGAT GGTGGATCCG ACGCTCTCCT TCGACCTTCC GCCCTTCCTC ACCCCCGAGC CGGGGCTGAA TTCCGGGCTG ATGATCGCCG AAGTGACGAC GGCCGCGCTC ATGAGCGAGA ACAAGCACAT GGCCGCCCCC ACCGTCACCG ACAGCACGCC CACCTCCGCC AATCAGGAAG ATCATGTCAG CATGGCGGCC CATGGCGCGC GCAGGCTCGG CCGGATGGTC GAGAACCTCG CGGTGATCCT CGGGACCGAG GCGATCTGCG CCGCGCAAGG GGTGGAGTTC CGCGCGCCCC TCGCCACCTC CGCCCCGCTC GGCGCCGTGC TGGCGCGGTT GCGCGCCGAG GTGCCGCGGC TCGGGGCCGA CCGCATCCTC GCCCCCGACC TCGCCGCGGC CGCCCGCCTC GTGCGCACAG GCGCGCTGGC CCGGGCCGCG GGCCTTCCCC TTCCCGCCCT CTGA
|
Protein sequence | MEILVPGRAT LAQLEAIWRE GRPARLAPEA RPAVEAAAAR VAEAAAGTAP VYGVNTGFGK LASLKIAPAD TAQLQRNLIL SHCCGVGEPM PPSTARLMMA LKLLSLGRGA SGVRWEIVAL LEGMLAAGVT PVIPAQGSVG ASGDLAPLAH MAAVMIGEGE AEVGGRRLPG AAALAEAGLA PVALGPKEGL ALINGTQFST AYALAGLFEG WRAAQAALVI SALSTDAIMG STAPLRPEIH ALRGHAGQIE AAATMRALLE GSAIRESHRE GDQRVQDPYC IRCQPQVTGA AMDVLRMAAG TLATEANAAT DNPLVLSDGR IVSGGNFHAE PVGFAADMIA LALSEIGAIA QRRVALMVDP TLSFDLPPFL TPEPGLNSGL MIAEVTTAAL MSENKHMAAP TVTDSTPTSA NQEDHVSMAA HGARRLGRMV ENLAVILGTE AICAAQGVEF RAPLATSAPL GAVLARLRAE VPRLGADRIL APDLAAAARL VRTGALARAA GLPLPAL
|
| |