Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0499 |
Symbol | |
ID | 5732413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 579618 |
End bp | 581315 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277625 |
Product | single-stranded nucleic acid binding R3H domain-containing protein |
Protein accession | YP_001543278 |
Protein GI | 159897031 |
COG category | [S] Function unknown |
COG ID | [COG3854] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02858] stage III sporulation protein AA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000861102 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGATC GTCGCGAAAT TACCCATAAT ATCGATCTGT TGCTGTCAAC CTTGCCGCCA CGTTTGGCCG AGCCGTTGGC AACCCATGAA CAGAAAGATC AAGTTATAGA AATTGTCATG GATTTGGGGC GCTTGCCCGA GGCGCGTTTT CGCCACGACC AATCAAGTTT CCTTAGCGAA ACTGAGGTCT CGCGCGAAGA TCTCGATTAT GTGACCGAGC GGATTGGTCA ATTTGGTGAA GATAATCGCG CTGGGATTCA GCGCACGCTG CACCGCATTT CAGCCATTCG CAACCGCTCT GGGGTGGTGA TCGGTCTGAC CTGTCGGGTT GGCCGCGCCG TTTATGGCAC AATCGAAATC GTGCGCGATT TGGTCGAGGC TGGTAAATCG ATCTTGATTC TGGGCAAGCC TGGGACAGGT AAAACCACCA TGCTGCGCGA AGTCGCCCGC GTCTTGGCCG ACGACTTTCT CAAGCGGGTG GTGATCGTCG ATACCTCGAA CGAAATCGCT GGCGATGGCG ATATTCCTCA CCCAGGGATT GGTCGCGCTC GGCGAATGCA AGTAGCTCGG CCAGCCGAAC AGCACAATGT GATGATCGAA GCGGTCGAAA ACCACATGCC CCAAGTCATT GTGATCGATG AAATTGGGAC GGAGTTGGAA GCTCAAGCTG CTCGCACCAT CGCCGAACGT GGGGTGCAAT TGGTTGGTAC GGCTCACGGG AATACCTTGG AAAACCTGAT GCTCAACCCA ACTCTCTCCG ATCTGATCGG CGGGATTCAG GCAGTAACCT TGGGCGATGA AGAAGCTCGA CGGCGGGGAA CCCAAAAAAC CGTGCTCGAA CGCAAAGCGC CGCCAACCTT CGATGTGTTG GTCGAAATTC AAAGCTGGGA CGATGTAACG ATTTATCAAG AAGTCGCTTC AGCCGTCGAT TCGATTTTGC AGGGCAATGA GCCAACCGCC GAACAACGCA CCAAGGACGA GCAAGGCGAA ATTGTGGTGC ACGAAGGCCG CCCAGAACGG CTTGATAGCG AAGCTAGCAC CATGCTCACG CGGCGTGGCG GCTATCGCAG CCGTGAACGC GACCGTGATC GTGACCATGG CGAACGCGAC TGGCGGCGTA AAAGCGAACG GCGTGAAGCC CAACGCGAAG AGCGCGAACG CTATAGCTTG GCACTGAATG GCAGCACTAG CCCAAAAACC GAAGAACCAA GCGTGCTGGT TAAAAAGCCT GGTAAAAATG CGCCAGCCAA GATTTTTGCC TTTGGGGTTA GCCGCAATCG CTTGGAAAAA GCCTTAGATC GTTTGGGCAT TACTGCTAGT TTGGTGCGCG AAATGGAACA AGCCACGATG GTGATTACGC TGAAAAATTA CTATCGTCAG CACCCAACTC GGCTGCGTGA TGCCGAAGAA CGTGGAATTC CAGTGTATGT GCTACGCTCG AATACCCAAA CCCAAATGGA AGAATGCCTC GGCTCGGCCT TCGAAATCAG CATCAGTCCT AGCGACCCGC TGAGCGAGGC CATGGAAGAG GTTGAAGAAG CAATCAGTCA AGTCATGGAT GGCTCAACCG AAAGTATCGA ACTGAGTCCG CAAAGTTCCT ATGTGCGTAG GTTGCAACAT CAGATTGTTG AGCGGTACAA TCTACAATCA GAAAGTACTG GCAAAGAACC ACGTCGCCGC ATTCGGATCT TTCGCTAA
|
Protein sequence | MLDRREITHN IDLLLSTLPP RLAEPLATHE QKDQVIEIVM DLGRLPEARF RHDQSSFLSE TEVSREDLDY VTERIGQFGE DNRAGIQRTL HRISAIRNRS GVVIGLTCRV GRAVYGTIEI VRDLVEAGKS ILILGKPGTG KTTMLREVAR VLADDFLKRV VIVDTSNEIA GDGDIPHPGI GRARRMQVAR PAEQHNVMIE AVENHMPQVI VIDEIGTELE AQAARTIAER GVQLVGTAHG NTLENLMLNP TLSDLIGGIQ AVTLGDEEAR RRGTQKTVLE RKAPPTFDVL VEIQSWDDVT IYQEVASAVD SILQGNEPTA EQRTKDEQGE IVVHEGRPER LDSEASTMLT RRGGYRSRER DRDRDHGERD WRRKSERREA QREERERYSL ALNGSTSPKT EEPSVLVKKP GKNAPAKIFA FGVSRNRLEK ALDRLGITAS LVREMEQATM VITLKNYYRQ HPTRLRDAEE RGIPVYVLRS NTQTQMEECL GSAFEISISP SDPLSEAMEE VEEAISQVMD GSTESIELSP QSSYVRRLQH QIVERYNLQS ESTGKEPRRR IRIFR
|
| |