Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1858 |
Symbol | |
ID | 4711261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2029491 |
End bp | 2030516 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639856330 |
Product | aminotransferase, class I and II |
Protein accession | YP_001003424 |
Protein GI | 121998637 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01140] L-threonine-O-3-phosphate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAAC AGCACCCCCT GCCCCGCATG GAGGAGCACG GCGGCAATCT TGATCAGGCA ACAGCCCGCT TCGGGCTGCC CCGGGAGGGC TGGGTGGACC TCTCGACCGG GATCAATCCG ACCCCCTTCC CGCTGACCCC TGCGCCCGAC GCGGCGTGGC ACCGGCTGCC AGAGGCCGAC GACCTGGAAG CGCGGGCGGC TGAACACTAT CGGGCCGGAA ACAACGCCGC CCTCGCCCTG CCCGGCTCCC AGGCAGCCAT CAGCCTGCTC CCGGCGCTCG AACCCCCGGG ATACGTAGCC ATCCCCGCCC CGGAGTACGC CGAACATGCG CGGGCCTGGC AGCGCTGGGG CCACCGGGTC GAGCGGCTCA CCGCCGACTG CATCGCCGCC GGACCGCCCC GGCGGCTGCC CTGGCAGACG ATGGTATTGA GCCACCCGAA CAACCCCACC GGAACCCGCC ATTCGGCTGC CACTCTACTG GCCTGGTGCG ATGCGCTGGC GGCCGAGGGC GGGCAGCTGA TTGTCGACGA GGCTTTCTGC GACGCCGAAC CGGAGACCTC CCTAGCGCCG TCCGCCGGGC GCCCGGGCCT GGTGCTCCTG CGCTCGCTGG GCAAGTTCTA TGGCCTGGCC GGCGCCCGGG TCGGATTCCT GCTGGGCCCG CAGGCGCTCC GCCAGCGGTT GGCCGACCTC CTCGGCCCGT GGCCGGTGGC GGGTCCGGCA CGCCACGCCG CCCGCCAGGC GCTGGCAGAC AGCGCCTGGC AAGACCGCCA GCGGCACGTC TTGGCGGCGT CGAGTGAACG GCTGGACCAC CTGCTGACCC GGGCCGGACT CGCCCCGACC GGCGGCACGG CGCTATTCCG CTGGACCCCC TGCCACGACG CCCGCCAACG CCAGGCGGAA CTGGCCCGTG CCGGCATTTG GGTACGCGCC TTCGATGCGC CAGCGGGGCT ACGCTTCGGC CTGCCGGGAC CGGAATCCGA CTGGCAGCGC CTGGCCGCGG CCCTGGGGTG CCCGCCAGGG GACTGA
|
Protein sequence | MAEQHPLPRM EEHGGNLDQA TARFGLPREG WVDLSTGINP TPFPLTPAPD AAWHRLPEAD DLEARAAEHY RAGNNAALAL PGSQAAISLL PALEPPGYVA IPAPEYAEHA RAWQRWGHRV ERLTADCIAA GPPRRLPWQT MVLSHPNNPT GTRHSAATLL AWCDALAAEG GQLIVDEAFC DAEPETSLAP SAGRPGLVLL RSLGKFYGLA GARVGFLLGP QALRQRLADL LGPWPVAGPA RHAARQALAD SAWQDRQRHV LAASSERLDH LLTRAGLAPT GGTALFRWTP CHDARQRQAE LARAGIWVRA FDAPAGLRFG LPGPESDWQR LAAALGCPPG D
|
| |