Gene Haur_5076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5076 
Symbol 
ID5737034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp93117 
End bp94595 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content45% 
IMG OID641282241 
ProductCRISPR-associated Cst1 family protein 
Protein accessionYP_001547832 
Protein GI159901586 
COG category 
COG ID 
TIGRFAM ID[TIGR01908] CRISPR-associated CXXC_CXXC protein Cst1 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAGG AAACTGGTCA TCCGCTGGTC GATGTAGGGT TTGCCACCAT TGCCGCCCAT 
GTCAACAAAA CCAACATCAA GGCCGTTACT GCCGCAGATC TCGAAAAGGT GGCGGCATAT
CTTGAGGCTG AGTTTGTGGT GAACCCATTG CGTTCATTTC TCACCGTCGC GTTTACCAGC
AATGCCTGGT TTATCCAAGA TGCTTACAAT CCTGATAAAC CTCAATTAAC TGATGAGCAG
CGCACAACCC GTCGGGCAAC CCGTAAAAAA TTGGCTGATG CCCATCTGCG TCAATGGCAA
CAACCAAGTA CCAGCGAGCA ACGTTGTGTT TTTACCGATG AGCCAGCAGT TGGCGAAGTT
TTATCAAATA CATTAACGGT TGGGCGGGCA GCACGTGCCC AAATTCCCTT ATTGCAGGGC
GACGCTACGA TCAACTTTTT TCCGAATGGC AATGCTGGCC TGCCAGTTTC GGGGATTGCT
CTCTTGGCGT TGCAAGCTTT CCCGCTGGGC AGCGCCAAGG CTGGCAATGG CATCTTAGTG
GTGCACTCGG CCAACCCACG CTTGACCTTA AATTTTGCCA AAACCTTTTA TCAGCGCAAT
AAAAGCAAAA TTGAACGAGC ACGAATTGCA GGCGATGATC GGGTTCCAGG CGAAGTACGC
TCACCAAAAA CCTTATTAAT TGAAACGTTG TTTGAAATTA CAGCCGAACG CGAAAATTAC
GATGACGACC AAATTACCAG TATTGTGGCC TACAACCTGA ACAATGGTAA AACGCCGAGC
TTAGCAATTT ATGAACTGCC CCATGAAATT ATTTTGTTCA TGCTCGAAGC TAAAGGGCAT
TTTCCTAAGC AATGGAATCA ATTAGTGCAT GGAGCTTGGG AACAAGCTTC GGCGGCAAGT
AAGAAATCAA ATCAACATTT TGAGCCACGG CGCAATTATT TTTATGAAGA TATGTTTGAT
CTACCAAATA ATGCCAAACG CTTTGTTCGT ACATATTTCC TCAAAATGCC CTATTTAGGC
ACGCTCGATG ATGATAATCA ATCACGCTAT TACTTACACG ATTATTATCA TTTGATTGAT
TTTCCAATTG TTGAACTTTT TTTAAGGAGG ATTTTCCTGA TGGACGACCT ACGGATCGCA
CGAATTAAGC ATTTTGGCGA TAAATTAGCC AATTATGTCC GTGCTGAAAA TGGCAAACGC
TTTTTTCGGG CTTTTGCCAG CGAGTATAAA TATCAAGATT TTCGGCAGCG CTTAATTAAA
GCGAGTGAAG ATTATGTACG CTCTAGCCAA GAACCCTTAT TTACCATTGA TGAATATGTC
GATGTTTTTG AACGCCAATA TAGCGACCAA GCTCCCGATT GGCGGTTATC CCGCGATCTG
GTGCTGATTC GCATGATTGA ACAATTAAAG GATTGGCTGA GTCGCAACCC CGATGCGATG
CCTGAGCGGC CTGAGCCAAA GCAGGAACAA TCGAATTAA
 
Protein sequence
MLKETGHPLV DVGFATIAAH VNKTNIKAVT AADLEKVAAY LEAEFVVNPL RSFLTVAFTS 
NAWFIQDAYN PDKPQLTDEQ RTTRRATRKK LADAHLRQWQ QPSTSEQRCV FTDEPAVGEV
LSNTLTVGRA ARAQIPLLQG DATINFFPNG NAGLPVSGIA LLALQAFPLG SAKAGNGILV
VHSANPRLTL NFAKTFYQRN KSKIERARIA GDDRVPGEVR SPKTLLIETL FEITAERENY
DDDQITSIVA YNLNNGKTPS LAIYELPHEI ILFMLEAKGH FPKQWNQLVH GAWEQASAAS
KKSNQHFEPR RNYFYEDMFD LPNNAKRFVR TYFLKMPYLG TLDDDNQSRY YLHDYYHLID
FPIVELFLRR IFLMDDLRIA RIKHFGDKLA NYVRAENGKR FFRAFASEYK YQDFRQRLIK
ASEDYVRSSQ EPLFTIDEYV DVFERQYSDQ APDWRLSRDL VLIRMIEQLK DWLSRNPDAM
PERPEPKQEQ SN