Gene Haur_0499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0499 
Symbol 
ID5732413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp579618 
End bp581315 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content54% 
IMG OID641277625 
Productsingle-stranded nucleic acid binding R3H domain-containing protein 
Protein accessionYP_001543278 
Protein GI159897031 
COG category[S] Function unknown 
COG ID[COG3854] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02858] stage III sporulation protein AA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000861102 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGATC GTCGCGAAAT TACCCATAAT ATCGATCTGT TGCTGTCAAC CTTGCCGCCA 
CGTTTGGCCG AGCCGTTGGC AACCCATGAA CAGAAAGATC AAGTTATAGA AATTGTCATG
GATTTGGGGC GCTTGCCCGA GGCGCGTTTT CGCCACGACC AATCAAGTTT CCTTAGCGAA
ACTGAGGTCT CGCGCGAAGA TCTCGATTAT GTGACCGAGC GGATTGGTCA ATTTGGTGAA
GATAATCGCG CTGGGATTCA GCGCACGCTG CACCGCATTT CAGCCATTCG CAACCGCTCT
GGGGTGGTGA TCGGTCTGAC CTGTCGGGTT GGCCGCGCCG TTTATGGCAC AATCGAAATC
GTGCGCGATT TGGTCGAGGC TGGTAAATCG ATCTTGATTC TGGGCAAGCC TGGGACAGGT
AAAACCACCA TGCTGCGCGA AGTCGCCCGC GTCTTGGCCG ACGACTTTCT CAAGCGGGTG
GTGATCGTCG ATACCTCGAA CGAAATCGCT GGCGATGGCG ATATTCCTCA CCCAGGGATT
GGTCGCGCTC GGCGAATGCA AGTAGCTCGG CCAGCCGAAC AGCACAATGT GATGATCGAA
GCGGTCGAAA ACCACATGCC CCAAGTCATT GTGATCGATG AAATTGGGAC GGAGTTGGAA
GCTCAAGCTG CTCGCACCAT CGCCGAACGT GGGGTGCAAT TGGTTGGTAC GGCTCACGGG
AATACCTTGG AAAACCTGAT GCTCAACCCA ACTCTCTCCG ATCTGATCGG CGGGATTCAG
GCAGTAACCT TGGGCGATGA AGAAGCTCGA CGGCGGGGAA CCCAAAAAAC CGTGCTCGAA
CGCAAAGCGC CGCCAACCTT CGATGTGTTG GTCGAAATTC AAAGCTGGGA CGATGTAACG
ATTTATCAAG AAGTCGCTTC AGCCGTCGAT TCGATTTTGC AGGGCAATGA GCCAACCGCC
GAACAACGCA CCAAGGACGA GCAAGGCGAA ATTGTGGTGC ACGAAGGCCG CCCAGAACGG
CTTGATAGCG AAGCTAGCAC CATGCTCACG CGGCGTGGCG GCTATCGCAG CCGTGAACGC
GACCGTGATC GTGACCATGG CGAACGCGAC TGGCGGCGTA AAAGCGAACG GCGTGAAGCC
CAACGCGAAG AGCGCGAACG CTATAGCTTG GCACTGAATG GCAGCACTAG CCCAAAAACC
GAAGAACCAA GCGTGCTGGT TAAAAAGCCT GGTAAAAATG CGCCAGCCAA GATTTTTGCC
TTTGGGGTTA GCCGCAATCG CTTGGAAAAA GCCTTAGATC GTTTGGGCAT TACTGCTAGT
TTGGTGCGCG AAATGGAACA AGCCACGATG GTGATTACGC TGAAAAATTA CTATCGTCAG
CACCCAACTC GGCTGCGTGA TGCCGAAGAA CGTGGAATTC CAGTGTATGT GCTACGCTCG
AATACCCAAA CCCAAATGGA AGAATGCCTC GGCTCGGCCT TCGAAATCAG CATCAGTCCT
AGCGACCCGC TGAGCGAGGC CATGGAAGAG GTTGAAGAAG CAATCAGTCA AGTCATGGAT
GGCTCAACCG AAAGTATCGA ACTGAGTCCG CAAAGTTCCT ATGTGCGTAG GTTGCAACAT
CAGATTGTTG AGCGGTACAA TCTACAATCA GAAAGTACTG GCAAAGAACC ACGTCGCCGC
ATTCGGATCT TTCGCTAA
 
Protein sequence
MLDRREITHN IDLLLSTLPP RLAEPLATHE QKDQVIEIVM DLGRLPEARF RHDQSSFLSE 
TEVSREDLDY VTERIGQFGE DNRAGIQRTL HRISAIRNRS GVVIGLTCRV GRAVYGTIEI
VRDLVEAGKS ILILGKPGTG KTTMLREVAR VLADDFLKRV VIVDTSNEIA GDGDIPHPGI
GRARRMQVAR PAEQHNVMIE AVENHMPQVI VIDEIGTELE AQAARTIAER GVQLVGTAHG
NTLENLMLNP TLSDLIGGIQ AVTLGDEEAR RRGTQKTVLE RKAPPTFDVL VEIQSWDDVT
IYQEVASAVD SILQGNEPTA EQRTKDEQGE IVVHEGRPER LDSEASTMLT RRGGYRSRER
DRDRDHGERD WRRKSERREA QREERERYSL ALNGSTSPKT EEPSVLVKKP GKNAPAKIFA
FGVSRNRLEK ALDRLGITAS LVREMEQATM VITLKNYYRQ HPTRLRDAEE RGIPVYVLRS
NTQTQMEECL GSAFEISISP SDPLSEAMEE VEEAISQVMD GSTESIELSP QSSYVRRLQH
QIVERYNLQS ESTGKEPRRR IRIFR