Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3664 |
Symbol | |
ID | 5735525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4606508 |
End bp | 4608013 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280813 |
Product | L-arabinose isomerase |
Protein accession | YP_001546428 |
Protein GI | 159900181 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGACT TGAAACAATA CGAAGTTTGG TTTATTACGG GCAGCCAACA TTTGTATGGC CCCGAAACTT TAGAACAAGT TGCCAAACAC TCCCAAATTA TCGCCGCTGG GCTTGATCAA AGTAGCACAA TTCCGGTGCG GGTGGTGTTT AAGCCTGTGC TTAAAACGCC CGACGAAATT TATAATTTGG TGCTCGAAGC CAACGCCGCC AAAAACTGCA TTGGCTTGGT CGCGTGGATG CATACCTTCT CGCCAGCCAA AATGTGGATT GCCGGCCTAC GCAGCTTGCA AAAACCATTG GCTCACTTGC ACACCCAATT CAACAGCGAA ATTCCATGGT CGGAAATCGA CATGGATTTT ATGAATCTCA ACCAAGCCGC CCACGGCGAC CGCGAATTTG GCTTTATTGT CAGCCGCATG CGCTTGGAAC GCAAAGTGAT TGTGGGTCAT TGGCAAGATA GCGAAGTGCA TGATCGAATT GCTGCCTGGA CTCGCGCCGC CGCTGCTTGG CACGATGCTC AAGGCGCACG CTTCGCCCGC TTTGGCGACA ACATGCGTGA AGTTGCCGTA ACCGAAGGCG ATAAAGTTAA TGCTCAAATG CGGCTTGGCT ATAGCGTCAG CGGCTATGGC GTGGGCGATC TAGTGCGCTT TGTCAACGAA GTTAGCGATG CCGATATCGA CAGCACGTTG CAAGAATATG CCGAACAGTA CGAATTGGCA GCAGGTTTAC AAGCTGGCGG CGAGCAGCAT CACTCGCTAC GTGAAGCCGC CCGCATCGAG CTAGGCTTGC GCTATTTCCT TGAGCATGGC AATTTCAAAG GCTTTACAAC GACGTTTGAA GATTTACATG GCTTAGTGCA ATTGCCAGGT TTAGGCCCAC AACGCCTGAT GGATCGTGGC TATGGCTTTG CTGGCGAGGG CGATTGGAAA ACCGCTGCGC TGGTGCGGGC AATGAAGGTG ATGAGCGCTG GCCTCAACGG TGGCACTTCT TTCATGGAAG ATTACACCTA TCACTTTGGC AGCAACGGCA TGAAAGTGCT AGGCGCACAT ATGCTCGAAA TCTGTCCATC AATTGCTGCG ACTAAACCAC GGCTCGAAGT GCATCCCTTG GGCATTGGTG GCAAGGCTGA TCCGGTGCGT ATGGTATTTG ATGCCAAAAC TGGACCTGCC GTCAACGCCT CAATCGTGGA AATGGGCAAT CGTTTGCGCT TGATCAATAG CGTGGTTGAT GCAGTTGAAA CCGACCAACC CTTGCCCAAA TTACCAGTTG CCCGCGCCTT GTGGCTACCC CAGCCCGATC TCAAAACCGC TGCGGCAGCT TGGATCTACG CAGGCGGAGC GCATCACACG GGCTTTAGTT TTGATCTGAC CAGCGAACAT CTCGCCGACT TTGCTGAAAT TGCTGGCATG GAATATTTGC AAATTGATCG CAACACCAAC GTTCAGCAAT TCAAACAAGA ACTTCGTTGG AACGATCTGT ACTATCACTT GGCCAAAGGT TTGTAA
|
Protein sequence | MLDLKQYEVW FITGSQHLYG PETLEQVAKH SQIIAAGLDQ SSTIPVRVVF KPVLKTPDEI YNLVLEANAA KNCIGLVAWM HTFSPAKMWI AGLRSLQKPL AHLHTQFNSE IPWSEIDMDF MNLNQAAHGD REFGFIVSRM RLERKVIVGH WQDSEVHDRI AAWTRAAAAW HDAQGARFAR FGDNMREVAV TEGDKVNAQM RLGYSVSGYG VGDLVRFVNE VSDADIDSTL QEYAEQYELA AGLQAGGEQH HSLREAARIE LGLRYFLEHG NFKGFTTTFE DLHGLVQLPG LGPQRLMDRG YGFAGEGDWK TAALVRAMKV MSAGLNGGTS FMEDYTYHFG SNGMKVLGAH MLEICPSIAA TKPRLEVHPL GIGGKADPVR MVFDAKTGPA VNASIVEMGN RLRLINSVVD AVETDQPLPK LPVARALWLP QPDLKTAAAA WIYAGGAHHT GFSFDLTSEH LADFAEIAGM EYLQIDRNTN VQQFKQELRW NDLYYHLAKG L
|
| |