Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1115 |
Symbol | |
ID | 5733007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1277182 |
End bp | 1278345 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278254 |
Product | hypothetical protein |
Protein accession | YP_001543891 |
Protein GI | 159897644 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.483933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAAGG CATGGGTGGT TGCAGCTGGC TTTTTTCTGA TTTGTGCGTT AGTCGTGGTT GCGGTGAGCC AAGGTTGGTT GGCTTCATCA GCGCCCTTGG CAATTTTGCC CACCCGGGTG CCTGGCACGC CCGACCCAAG CCAGGCAATT AAGTTGCAAT TGGTCTATAG CAGTGAAAAA GAGGTTTGGC TCAAGCAATC GGTCAATGCG TGGCAGGCAA CCAATCCGAC TGTCGATGGC AAGCCGATTC AAGTCGAATT GATTTCGCGT GGTTCGCAAG AGATTGTAAC GCAGTTGCAG GATGGCAGTT TGCGACCAAC CGCGATCAGC CCCGCTAGCA CCTTACAAAT TACCCAAATT AATGTCGCCC AGATCGATGC TAAAACCAAT TTGGCTGCTG ATGCCGAACC GATGTTGATT TCGCCATTGG TGCTAGCTGC CTACGAAGGT TCGCCCGCCG CCAGCTTGTT GAAATCGCAA GATCCCCAGC TTTGGCAAGC CTTGCACGAT GGGGTTCTCA TTCCCCAGCG CAGTCAAAAA ATTCTATTTG CCCAAACTAG CCCGACCACT TCAAATAGCG GTTTTCAGGC CTGGATTTTG ATGGCCTATG CTTACCATAA TAAAGCTAGT GGCTTAACCG CTGGAGATGT TAATGATCCT AAATTTGTGG AATGGCTCCA AGGCTACGCC AAAAATGTTG AGCAATTTGC TGAAAGCACT GGTTCGTTAA TGCAGAAAAT GGTGCAATTT GGGCCAAGTA GCTATGGCGC AGCCGTGGTC TATGAAAGCA CGGCGCTTGA GTATTTGCCC AAGGCCCAAG GCCGTTTTGG TAAATTAATC TTGGTCTATC CACCGCAAAA TCTTTATAGC GACCATCCTT TGGCCATTTT GGATGCTTCA TGGAGCAGCG ATGATCAACG GGAAGCAGCT GGATTATTGC GCGAATTTCT ACTAGCCAAG CCACAACAAA CCACGGCTGT CCAACAAGGC TTTCGCCCTG TCGATCCCGA TATTGCGATC GATGCCGCCG ATTCGCCTTT TGTTAAATAT CAAAGTTCCG GAGTGCAAAT GAGCATTCCA AATTTGGCGG CTGAGCCAGA TGCTGCCACA ATCAAGGCAG TGCTGGATTT GTGGAACTCG TTAGCGGCCA GTGTTAAACG CTAA
|
Protein sequence | MRKAWVVAAG FFLICALVVV AVSQGWLASS APLAILPTRV PGTPDPSQAI KLQLVYSSEK EVWLKQSVNA WQATNPTVDG KPIQVELISR GSQEIVTQLQ DGSLRPTAIS PASTLQITQI NVAQIDAKTN LAADAEPMLI SPLVLAAYEG SPAASLLKSQ DPQLWQALHD GVLIPQRSQK ILFAQTSPTT SNSGFQAWIL MAYAYHNKAS GLTAGDVNDP KFVEWLQGYA KNVEQFAEST GSLMQKMVQF GPSSYGAAVV YESTALEYLP KAQGRFGKLI LVYPPQNLYS DHPLAILDAS WSSDDQREAA GLLREFLLAK PQQTTAVQQG FRPVDPDIAI DAADSPFVKY QSSGVQMSIP NLAAEPDAAT IKAVLDLWNS LAASVKR
|
| |