Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1083 |
Symbol | |
ID | 5732872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1239451 |
End bp | 1240800 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278221 |
Product | UspA domain-containing protein |
Protein accession | YP_001543859 |
Protein GI | 159897612 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0589] Universal stress protein UspA and related nucleotide-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000280681 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTACT CAATCGTGAT CATTGACCCA GATCAAGGTT CAGCCCAAAC AACTGCCGCG CGATTTCAGC GGCGTTGGGG CCAGAACGTC CAAGTCAATA TTGTTAAACA AGCTGATTGG GCTAGTGTTC AGGCTCATAG CCCCAACCTA GTCGTGATTG ACCCCGCGCC ATATCGGCTG CATGGGTTGC GCTTACTCGA ACAACTTTGC GAAGAGCAAC CATCAACAGC CATTGCCGTG GTCGCTTCGG GTAGCTCCCC AAGCATGCGG CAACGCCTAC GCAACTTGCC AATCACTTCA TATTTGGAAA AACCTAGCTC GTTAGCCCCC CTACTCGGTG AACTTGATCA CCTTGTTGAG AGCAATGTCG CTCTCCAATC CAAAGGAGGA TTGGTTATGG AACGCCAAAT GTTGATTCCA CTTGATGGTT CACTGTTAGC GGAACAAGCT TTGGATTATG CGGTGGTTTT AGCTCGCCGC AACAGCAGTG TTTTGCATTT AGTTCGGGTG ATTGGCTATC CACCGCTGGT TCCGGCTTAC GAATGGCCAG TGCCAAGCGC CGTTGATAGC CGTCAATGGT TAGCCGACGA ACGTCAAGCC GCCCAAACCT ACCTTGATGA ACTTAAAGCT CGTTATGAGC AACAAGGCTT AGCGGTACGA ACAACCGTGC TTGATGGCGA GCCAGCTCAT GCAATTGTCC AATTTGCGAC CGAACAAAAT AGCGTGCGCG AAATTGTGCT CGCCAGTCAT GGTCGCAGTG GGCTTGGGCG TTGGGTACTC GGTAGCATCG CGGAAAAATT AGTCCAAGCT ACGCCAGTGC CAATTTTGGT CATCCATGGC GATGAACGCA AAGTCGAAAC CTTTAAAATT CCGCCAGAAT TGCGCACGAT TGTTGTGCCA CTCGACGGCT CAGCCATTGC TGAACAAGCT TTGCCCTTGG CCAGCCAACT AGCCGAGGCC CATAGTGCTG AACTAGTCTT GCTGAGTGTC ACCCCGGGCA TCGACGATCC TGGTTTGATC GAATCGGGAC TTGTGCCAAT GTGGAGTGCT GGCGAAAAAG CTCAAGCCCG TGATCAAGCC CAGAAATATT TGCAACAGCT TGAACAATCG TTGCAAACGC CACGGCTACG TCTCCGCCAT CTCGTCGTGA GCGGCACACC CGCCGAAATG ATCGACGAAA TTGCCCAAGA AGCAGTTGCC AGCATGATCG TTATGGCAAC CCATGGACGC AGTGGCTTCA GTCGCATGTG GATGGGCAGC GTAGCAACCA AGCTCATTCG CAGCAGCCAA CGGCCAATCT TCTTGGTACG GGCGGTCGAA CAAGCTGCCG AACGCGGCCA ACCACTCTAA
|
Protein sequence | MSYSIVIIDP DQGSAQTTAA RFQRRWGQNV QVNIVKQADW ASVQAHSPNL VVIDPAPYRL HGLRLLEQLC EEQPSTAIAV VASGSSPSMR QRLRNLPITS YLEKPSSLAP LLGELDHLVE SNVALQSKGG LVMERQMLIP LDGSLLAEQA LDYAVVLARR NSSVLHLVRV IGYPPLVPAY EWPVPSAVDS RQWLADERQA AQTYLDELKA RYEQQGLAVR TTVLDGEPAH AIVQFATEQN SVREIVLASH GRSGLGRWVL GSIAEKLVQA TPVPILVIHG DERKVETFKI PPELRTIVVP LDGSAIAEQA LPLASQLAEA HSAELVLLSV TPGIDDPGLI ESGLVPMWSA GEKAQARDQA QKYLQQLEQS LQTPRLRLRH LVVSGTPAEM IDEIAQEAVA SMIVMATHGR SGFSRMWMGS VATKLIRSSQ RPIFLVRAVE QAAERGQPL
|
| |