Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1121 |
Symbol | |
ID | 4710081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1217408 |
End bp | 1219195 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639855593 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001002699 |
Protein GI | 121997912 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCACGC TCGACCAAGC CCTGATTGGC AACTGCGCGT TTGCCGCCCT GGTCAACCGC CAGGCCGAGA TCACTTGGGC GTGCATGCCC CGCTTCGATG GCGATCCGGT CTTCTGCTCC CTGCTCGGCG ATCCCGCGGC CGGTGCCGGC GCCGGGCGTT TCGCCGTCGA GCTGGAGGGG CTGGCCCGCA CCGAGCAGTG GTATGTTCGC AATACCGCCA TCGTGGTGAC GCGGCTCTAC GACCACCAGG GCGGGGTGGT GGAGGTGACC GATTTCGCCC CCCGCTTCGA GCAGTTCGGC CGCACCTTTC GCCCGGTGGA GCTGATCCGT CAGGTCCGTC GGGTTGCCGG CTCACCCCTG GTCTCTCTGG TGGTGCGGCC GGTGTTCGAT CACGGCCGGT CGTTGCCCAG CGTGACTCAG GGCAGCAACC ACGTGCGCTA CGTCGGCCCG GGCCAGGTCC TGCGCCTGAC CACCGACGCC TCGTTGACCG CGGTCCTGGA CGAGACCCCG TTCATCCTCG AGGACGACCT GACGCTGGCC TTCGGTCCCG ACGAGACCCT GCTCCAGTCG GCCCGTGAGA CCGGCTACCA GTTCTATGAG CACACCCTAA CCTATTGGCA GGAGTGGGTG CGCAACCTGG CCATCCCCTT TGAGTGGCAG GAGGCGGTGA TCCGCGCGGC GATCACCCTC AAGCTCAACA CCTACGAGGA CACCGGTGCG GTGATCGCCG CGGTGACCAC TTCGATCCCC GAGGCCCCGG ACACCGGCCG CAACTGGGAT TACCGTTACT GTTGGCTGCG GGACGCCTAC TTCGTGATCA ACGCCCTCAA CCGCCTGGGG GCGACCAAGA CGATGGAGCA CTACGTGCGC TTCATCATCA ACACCACTGC CCGCAACGAG GGGCGGTCGC TGCGGCCGGT GTATCGCATC AACGGGCGTG ATGACCTTTA CGAGACCATC GCCTACGGCT TGCCCGGATA CCGGGAGATG GGGCCGGTGC GCATCGGCAA CCAGGCCTAC GAGCAGCAGC AGAACGATGT TTACGGGTCA GTGATCCTGG CCACCGCGCA CTTGTTCTTC GACGAGCGGC TGCGCCGCCG CGGCGATGAA TCGCTGTTCC GCCGTCTCGA GCAGCTCGGT GAGCAGGCCG TCGCCGTCTA CCGCGAGCCG GATGCTGGCC CCTGGGAGTT CCGCGGCTTT GAAAAGGTGC ATACCTTCTC GGCGGCGATC TGTTGGGCGG CTGCCCGGAA CCTGCGTGCC ATCGCCGCCA AGCTGGGGTT GATGGAGCGG GCTGATTACT GGCGCCGCCG GGCCGACGAG ATGGCCGACA CCATCCGCAA CAGCGCCTGG AACGAGCAAC GCAACAGCTA CATGGCCAGT TTCGGCGGCC AGGACCTCGA CGCCAGCCTG ATGCTGCTCT ACGAGTGGGG GTTCCTGCGT GCCGGAGACC CGCGCCTGGC CGGGACCGTG CGCGCCGTCG AGCAGGAGCT GCGCCACGGC GACTTCCTCT TCCGTTACGT CCACGAGGAC GATTTCGGCA AGCCCCACAC GGCCTTCACC ACCTGTACCT TCTGGTACAT CGACGCCCTG GCGGCGGTGG GCCGTGAGGC CGAGGCCCGC GCACTGTTCG AGCGCCTGCT CGAATGTCGC AACCACGTGG GTCTGCTCTC GGAGGATATC GATCCGTACA CCGGAGAACT CTGGGGGAAC TTCCCGCAGA CCTACAGCAT GGTCGGTCTG ATCAACTCCG CAATGCGCCT GAGTCGCAGT TGGGAGGAAC CGCTGTGA
|
Protein sequence | MSTLDQALIG NCAFAALVNR QAEITWACMP RFDGDPVFCS LLGDPAAGAG AGRFAVELEG LARTEQWYVR NTAIVVTRLY DHQGGVVEVT DFAPRFEQFG RTFRPVELIR QVRRVAGSPL VSLVVRPVFD HGRSLPSVTQ GSNHVRYVGP GQVLRLTTDA SLTAVLDETP FILEDDLTLA FGPDETLLQS ARETGYQFYE HTLTYWQEWV RNLAIPFEWQ EAVIRAAITL KLNTYEDTGA VIAAVTTSIP EAPDTGRNWD YRYCWLRDAY FVINALNRLG ATKTMEHYVR FIINTTARNE GRSLRPVYRI NGRDDLYETI AYGLPGYREM GPVRIGNQAY EQQQNDVYGS VILATAHLFF DERLRRRGDE SLFRRLEQLG EQAVAVYREP DAGPWEFRGF EKVHTFSAAI CWAAARNLRA IAAKLGLMER ADYWRRRADE MADTIRNSAW NEQRNSYMAS FGGQDLDASL MLLYEWGFLR AGDPRLAGTV RAVEQELRHG DFLFRYVHED DFGKPHTAFT TCTFWYIDAL AAVGREAEAR ALFERLLECR NHVGLLSEDI DPYTGELWGN FPQTYSMVGL INSAMRLSRS WEEPL
|
| |