Gene Haur_3713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3713 
Symbol 
ID5735577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4670188 
End bp4671381 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content54% 
IMG OID641280865 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001546477 
Protein GI159900230 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000276005 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGTGG ACACCGAAAG CATGGATGCC GTTCAAGATT GGACGACACT GCTTGCTGAG 
TACGATTATC AACGCCCAGA ACGCGGTCAA TTGCGTGAAG GTATCGTCAT GCGCGTCGAA
GACAGCCAAA TCTTGGTTGA CATTGGCGCA AAACACGAGG GCGTTATCCC TAACCAAGAT
CTGCGCCGTT TGCCGCCAGA ATTGGTCAGT GGGATCAAAA ACGGTGACAC ACTGCAAGTG
TACGTGATGG AGCCAGAATC AAAAGAAGGC GAACTGGTTC TTTCATTGAA CATGGTGCAG
GTCGAGCGCG ATTGGCAAGA AGCCCAAACG ATGCTCGAAA ATGGTCAAAT CATCGAAGCT
GGCGTGGTCG GCTACAACAA GGGCGGTTTG TTGGTTCAAG TTGGGCGTGT GCGCGGTTTC
GTACCAGCCT CACAAGTGGT CAACTTGCAC AGCCGCACTG GCACTGAAGG CCAACAAAGC
GCCATGACCA AGATGGTCGG CCAAAATATT CCTTTGAAAG TTATCGAAGT TGATCGCGAT
CGCAATCGTT TGGTGCTTTC AGAGCGTGCC GCTATGCAAC GCTGGCGACA ATCGCAAAAG
GAACGCTTGC TCGAAACCCT CGAACCAGGC GCAGTCGTCA CTGGTCGGGT CAACCAACTC
ACTCCATTCG GTGCTTTCAT CGATTTGGGC GGTGCTGATG GTTTGGCTCA CATCTCAGAG
CTTTCATGGC AGCGCGTCAA CCACCCACGC GAAGTCTTGC AACCAGGCCA AGAAGTCCAA
GTATACGTCT TGGAAGTCGA TCGCGATCGC GAACGGATTG GCTTGAGCTT GCGCCGTTTG
CAGCCAGATC CATGGGCAAC CATCGATCAA CGCTACGACC TCGGCCAATT GATCGTTGGT
GAAGTAACCA ACATCGCTCC TTTCGGCGTG TTTGTACGCG TTGAAGAAGG CGTTGAAGGT
TTGATCCACG CTTCAGAATT GACCGAAAAC GGCCAATCGC CCGACTCGTT GCAACAAGGC
CAACAAGTGC AAGTGAAGGT GATCAGTCTT GATCGCCAAC GCCAACGCCT TGGCTTGAGC
TTGCGCCGCG TCGATGGCGA AGGTGAAGCC GCCGAAGCAC CAGCAGCTCC TGTAGCTGAA
GTGGTCGCCG AAGCCGCTAC CGAAGCTACC ACCGAAGAAG AAGTCGGCGC ATAA
 
Protein sequence
MTVDTESMDA VQDWTTLLAE YDYQRPERGQ LREGIVMRVE DSQILVDIGA KHEGVIPNQD 
LRRLPPELVS GIKNGDTLQV YVMEPESKEG ELVLSLNMVQ VERDWQEAQT MLENGQIIEA
GVVGYNKGGL LVQVGRVRGF VPASQVVNLH SRTGTEGQQS AMTKMVGQNI PLKVIEVDRD
RNRLVLSERA AMQRWRQSQK ERLLETLEPG AVVTGRVNQL TPFGAFIDLG GADGLAHISE
LSWQRVNHPR EVLQPGQEVQ VYVLEVDRDR ERIGLSLRRL QPDPWATIDQ RYDLGQLIVG
EVTNIAPFGV FVRVEEGVEG LIHASELTEN GQSPDSLQQG QQVQVKVISL DRQRQRLGLS
LRRVDGEGEA AEAPAAPVAE VVAEAATEAT TEEEVGA