Gene Haur_5081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5081 
Symbol 
ID5737039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp99827 
End bp100840 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content47% 
IMG OID641282246 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001547837 
Protein GI159901591 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAA TTGTCCATGA ACGGGGCACC TTTATTCAAA AACATCAAGG TCGTTTGCGG 
GTCATGCGTG AAAAAGAGCG TTTGGCTGAA GTTCCATTAT TAATTCTTGA TCACGTTATT
ATCGAATCGT ATGGTGTTGG AATTTCATCC GATGCAGTGC GAGCGTGTGC TGAGCATGGC
ATCCCGATTC ATTTTTTAAG TAGTACAGGT ATTGCCTATG CTTCGCTCTA TAGTGCCGGA
TTAACGGGTA CGGTGCAAAC ACGCCGTGCC CAATTACAAG CTTTTGAAAA TGAGCGCGGT
GCATGGCTAG CCCGTGCATT TGTCTGTGGC AAATTAGAAA ATCAGCATAA TTTGCTCCGA
ATGATGGCCA AATATCGCAA AACGGCTGAT CCTGCTTGTT TTCAGCGGGT TCAGCCAATT
ATCGCCGAAA TGCGTGATCA TATTATCGAA GCTGAGCGGG TTATGCCGCA GCAACTTGAG
CACATTCGGC CTTCACTGTT GAGTATCGAA GGTCGCGGCG CGGCTCGTTA TTGGTTTGGT
GTGCGTGAAT TATTGCTCTG TGATTTAGAT TGGCCTGGGC GTGAAACCCA AGGAGCACGT
GATCCGCTCA ATAGTGCCTT GAATTATGGC TATGGCATTT TGTATAGCCA AATCGAGCGT
TGCCTCGTTC TGGCTGGCCT TGATCCCTAT GGTGGCTTTA TGCACACTGA TCGGCCTGGT
AAACCATCAT TAGTGCTCGA TTTGATTGAA GAATTTCGCC AAACCGTGGT TGATCGTACC
ATTTTGGGTT TGGTCAATCG CAAAATGACG ATTGAGCAAG ATGAAACTGG CCGATTAAGC
GACCATACAC GCGAGATGAT TCGTGAGCGC CTATTTAAGC GTTTGGAAGC GAGTGAGCCA
TATGAGACCA AACGGGTGAG TTTGCGGGTA ATTATGCAAT CTCAAGCTCG CCATCTCGCA
ACATTTGTGC GCGGCGATCG CGACACCTAC ACACCATTTA TTGCCTCGTG GTAG
 
Protein sequence
MELIVHERGT FIQKHQGRLR VMREKERLAE VPLLILDHVI IESYGVGISS DAVRACAEHG 
IPIHFLSSTG IAYASLYSAG LTGTVQTRRA QLQAFENERG AWLARAFVCG KLENQHNLLR
MMAKYRKTAD PACFQRVQPI IAEMRDHIIE AERVMPQQLE HIRPSLLSIE GRGAARYWFG
VRELLLCDLD WPGRETQGAR DPLNSALNYG YGILYSQIER CLVLAGLDPY GGFMHTDRPG
KPSLVLDLIE EFRQTVVDRT ILGLVNRKMT IEQDETGRLS DHTREMIRER LFKRLEASEP
YETKRVSLRV IMQSQARHLA TFVRGDRDTY TPFIASW