Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3005 |
Symbol | |
ID | 4078035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 3173217 |
End bp | 3174740 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638008334 |
Product | histidine ammonia-lyase |
Protein accession | YP_614999 |
Protein GI | 99082845 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.36207 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAGA TGATCCCGGG CGCCGTGACG CTCGACACTT TGGAACGGAT CTGGCGTCAC GGCACGCCTG CACGTCTGGC CGATAGCGCG CGTGCAGGTG TCGAGGCCGC TGCCGCGATG GTGGCAGAGG CCGCAGCGGG TGAAGTGCCC GTTTATGGCA TCAACACTGG TTTTGGCAAA CTTGCGTCGA CCAAGATCGC GCCGGAGGAT ACTGCGACCC TGCAGCGCAA CCTGATCCTG AGCCATTCTT GCGGTGTGGG CGAGCCGCTG GCCGAAGACA AAACCCGGTT GATGATGGTG CTGAAGCTGT TGTCGCTGGG TCGGGGCGCC TCCGGGGTGC GCTGGGCTGT CATTGAGCAG ATCCAAGAGA TGCTGGCACG CGGGGTGACA CCTGTTGTGC CGTCGCAAGG GTCCGTTGGC GCCTCTGGTG ATCTTGCCCC GCTTGCCCAC ATGACTGCGG CCATGATCGG CGAAGGCGAG GCAACAATCG ACGGCGTGCG CCTGCCTGGT GCCGAAGCCT TGAGGCGTGC CGGGTTGGAG CCGATTGTGC TGGGCCCGAA AGAAGGGCTT GGCCTGATAA ATGGCACGCA GTTTTCCACC GCTTGCGCGC TCACCGGGTT GTTTGAGGCC CTGGAGATGG CGCGGGCCTC CATGGCGATC GCGTCTTTGA CAACCGACGC TATCATGGGC TCTACCGCGC CTTTGGTGGC GGATATTCAC AGCTTGCGCG GCCATGCTGG GCAGATGGAG GTCGCGGCAA CGATGCGCGA CATCATGGCG GGCTCGGAAA TTCGCGAGAG CCACCGTGAG GGCGACACCC GCGTGCAGGA TCCCTATTGC ATCCGCTGCC AGCCTCAGGT GGTGGGCGCC GCGCTTGATG TGCTGCGCAT GGCGGCGCGC ACGCTTGAAA TCGAGGCGAA CGCGGTCACC GACAATCCGT TGGTACTGGT GGAGGCGGGG CAGATCGTCT CCGGGGGCAA CTTCCATGCC GAATATGTGG GCTTTGCGGC AGATCAGATC GCGCTCGCCG TGGCTGAGAT CGGCGCGATT GCGCAGCGCC GGGTTGCGCT GATGGTGGAT CCTACCCTGA GCCACGACCT ACCACCGTTC CTGACGCCGA ACCCCGGCCT CAACTCGGGA TTCATGATTG CCGAAGTCAC GACTGCGGCG CTCATGAGCG AAAACAAACA TCTGGCCAAC CCCTGCGTTA CGGATTCCAC ACCGACCTCC GCCAACCAAG AGGACCACGT CTCTATGGCG GCGCACGGTG CGCTGCGGCT GGCGAAAATG AACGCAAACC TGTCGGTGAT CCTTGGGGTC GAGATGCTTT GCGCGGCGCA GGGGGTCGAG GCGCGCGCGC CGCTCAAGAC CTCTAGCCGC TTGCAGAACC TGCTCGACAT GCTGCGCGGC GAGATCCCGA GCCTTGGCGA GGACCGCTAT CTTGCGCCGG AAATCGAAAC CGCCAGCGCG ATGGTGCGGG CAGGCCGCGT GGCGCAGGCC GCAGGCGTGG AGGTCAGCAC ATGA
|
Protein sequence | MIEMIPGAVT LDTLERIWRH GTPARLADSA RAGVEAAAAM VAEAAAGEVP VYGINTGFGK LASTKIAPED TATLQRNLIL SHSCGVGEPL AEDKTRLMMV LKLLSLGRGA SGVRWAVIEQ IQEMLARGVT PVVPSQGSVG ASGDLAPLAH MTAAMIGEGE ATIDGVRLPG AEALRRAGLE PIVLGPKEGL GLINGTQFST ACALTGLFEA LEMARASMAI ASLTTDAIMG STAPLVADIH SLRGHAGQME VAATMRDIMA GSEIRESHRE GDTRVQDPYC IRCQPQVVGA ALDVLRMAAR TLEIEANAVT DNPLVLVEAG QIVSGGNFHA EYVGFAADQI ALAVAEIGAI AQRRVALMVD PTLSHDLPPF LTPNPGLNSG FMIAEVTTAA LMSENKHLAN PCVTDSTPTS ANQEDHVSMA AHGALRLAKM NANLSVILGV EMLCAAQGVE ARAPLKTSSR LQNLLDMLRG EIPSLGEDRY LAPEIETASA MVRAGRVAQA AGVEVST
|
| |