Gene Haur_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0502 
Symbol 
ID5732416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp583543 
End bp584790 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content53% 
IMG OID641277628 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001543281 
Protein GI159897034 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAGC AAACACAACC GGGCATGGAT CAATCTGATG TGCTTCCCAG CACAGGATTG 
AACGCGACCC CACAGGCCGA TTATTCTGGC GATGATGATC GTGCCCTGCT GGAGGAGTAT
CTCCGTGATC CAGCCCACGA TTATCGCAAT CTCAAGTATG GTGATTCGGT CGATGGCACG
ATTGTTCGCG TTGATCGCGA CGAGGTGTTG GTCGATATTG GCTCCAAGTC GGAAGGCGTG
GTGCCAGGCC GCGAGATGAC CAGTCTGTCG TCGGAAGAAC GCGCCGAACT CAAGGTTGGC
GATGTCTTGC TTGTTACAGT CGTTCAAACC GAAGACGCTG AAGGGCGTAT CGTGTTGTCG
ATAGATAAAG CACGCCAAGA AAAGAGTTGG CGAGCCTTGC AAGTCAACCA TGAGGCTGGC
GATGTGATTC ACGCCGCCGT GACCAACTAT AACAAGGGTG GTCTGTTGGT TAATTTAAGT
GGGGTGCGTG GCTTTGTGCC ATCATCACAG GTCAGCAGCG TCAGCCGTGG CTCCGATGTC
CAAAAACAAT CGGATATGGC AAAACTGGTC GGCCAAACCT TGCCACTGAA AATTATCGAA
ATCAATCGTT CGCGCAATCG GCTGATTCTA TCCGAGCGCC AAGCCGTCCA AGAGGTTCGC
GATTCGCGCA AGGATCAACT GCTTGAAAAA CTGGAACCAG GCGCAGTTCG CACTGGCCGC
GTAACCAGTT TGTGCGATTT CGGCGCGTTT GTCGATATTG GCGGAGCAGA CGGTTTGGTT
CACCTTTCCG AGCTTTCTTG GAGCCGCGTC AAACATCCCG AGGAAGTGCT GAAAGTTGGC
GATGCAGTCA GCGTCTATAT TTTAAGCGTC GATGAAGATA AAAAACGCAT CGCGCTGAGT
ATCAAGCGCA CCCAAGCTGA GCCTTGGACA ACCGTTACCG ACCGCTACCA AATTGGCCAA
AGCGTTTCAG GGGTTGTTAC TCAATTGACC GCCTTTGGCG CGTTTGTCCG GCTTGAAGAT
GGCATCGAAG GTCTGATCCA CATCTCAGAA ATGAGTGATG AACGGATTCA GCACCCACGC
GATGTGATTA ATGAAGGCGA TAGCGTTTCA GCCCGCATTA TTCGGATCGA CCCAACGCGC
AAGCGGATTG GCTTGAGTAC CCGCAGTGGC AGCGCTGAAG CAACCGCTGA AGCAACTGCT
GAAACAGCAA CCGAAGAACC AAGCGCTGCA GCCGAAGACG AAGAATAA
 
Protein sequence
MDEQTQPGMD QSDVLPSTGL NATPQADYSG DDDRALLEEY LRDPAHDYRN LKYGDSVDGT 
IVRVDRDEVL VDIGSKSEGV VPGREMTSLS SEERAELKVG DVLLVTVVQT EDAEGRIVLS
IDKARQEKSW RALQVNHEAG DVIHAAVTNY NKGGLLVNLS GVRGFVPSSQ VSSVSRGSDV
QKQSDMAKLV GQTLPLKIIE INRSRNRLIL SERQAVQEVR DSRKDQLLEK LEPGAVRTGR
VTSLCDFGAF VDIGGADGLV HLSELSWSRV KHPEEVLKVG DAVSVYILSV DEDKKRIALS
IKRTQAEPWT TVTDRYQIGQ SVSGVVTQLT AFGAFVRLED GIEGLIHISE MSDERIQHPR
DVINEGDSVS ARIIRIDPTR KRIGLSTRSG SAEATAEATA ETATEEPSAA AEDEE