Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0004 |
Symbol | |
ID | 4710075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 3787 |
End bp | 4653 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854460 |
Product | aminotransferase, class IV |
Protein accession | YP_001001601 |
Protein GI | 121996814 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | [TIGR03461] aminodeoxychorismate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000202983 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCCGTA AGCAACTGGT CAACGGCCGC GCGGATACGG CCCTCGACGC CGAGGATCGC GGCCTCGCCT ATGGCGATGG ACTGTTCGAG ACCGTAGCGG TCAGCCGCGG TCGGCTATGC CTCTGGGACT ACCACATGGA TCGGCTCCTA GACGGCGCGC GCCGACTCGG GCTGCCCGAG CCGCCCTTGG CCACCCTGCG GGAAGAGGCC CGTTTCCTCA CCGAGAAGGT GGAGCGGGGC GTACTGAAGG TGGTCTACAC CCGCGGCAGC AGTGAGGGTC GTGGCTACCT GCCGCCTGCC AGGCCAATCC CCACGCGAAT CCTGACACTG CACAATACTC CGGCGATTCC GCCGGAGCGC TGGCAGGGGG TTGATGTCCG GCTCTGTCGG ACCCGCATCA GCACGCAACC CCGGCTGGCC GGCATCAAGC ATCTCAATCG CCTGGAGCAG GTGATGGCCC GATCCGAATG GCGGGATGCC GCCATCGCCG AGGGCTTGAT GCTCGACGCC GACGGGCTCA TCGTGGAAGG CACAGCGACC AACCTCTTCG GAATCCGCAA TCGGGTGCTC ATGACGCCCC CTCTCACACA TTCAGGCGTG GCCGGTGTGA TGCGGCGCTG GGTCCTGGAG TACGCCGAGA CGCTCGGGCT GCGGGTCGAG CAGCGTGGCT TCTACCCGGG CGAGGTGTCC GAGATGGACG AGCTTTTTCT GACCAACAGC CTGATCGGCC TCTGGCCCGT CCGTTCCGTG GCGGGTACGC AGATACCGGT CGGACCGGTG AGTCAGCGCT ATCTCCAGGC AATCGCCGAT CATGGGCTCA CCCCGTTGGT TGAGGAGCCG GCGATGCGCG GCGGGGCAGG GCGTTGA
|
Protein sequence | MSRKQLVNGR ADTALDAEDR GLAYGDGLFE TVAVSRGRLC LWDYHMDRLL DGARRLGLPE PPLATLREEA RFLTEKVERG VLKVVYTRGS SEGRGYLPPA RPIPTRILTL HNTPAIPPER WQGVDVRLCR TRISTQPRLA GIKHLNRLEQ VMARSEWRDA AIAEGLMLDA DGLIVEGTAT NLFGIRNRVL MTPPLTHSGV AGVMRRWVLE YAETLGLRVE QRGFYPGEVS EMDELFLTNS LIGLWPVRSV AGTQIPVGPV SQRYLQAIAD HGLTPLVEEP AMRGGAGR
|
| |