Gene Tneu_0578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0578 
Symbol 
ID6164759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp528753 
End bp529664 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content65% 
IMG OID641667731 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001793966 
Protein GI171185047 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0010415 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0764761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGTGG TAGTTAAGGA GCACGGGGTG TCGCTGGGCT ACAGCAGGTG GGCCCTGGTG 
GTGAGGAGGA GGGGCGGGGC CGCCGAGAGG ATCCCCATAC ACCAAGTGGA CAGGCTGTGG
ATCCTCACTG GGGGCGTCTC CATCTCGTCT AGGCTAGTGA GGGCCCTGGC GAGGAGCTTC
GTAGATGTGG TGTTTTTCGA CGGCAGAGGC AACCCCGCGG CTAGGCTGTT TCCGCCTGAG
GCAAACGGCA CGGTTGCACA CCGGCGGGCT CAGTACGAGG CCTACCTAAA CGGCAGGGGG
CTCGAGCTGG CCAAGCTGGT GGTGTATGGA AAGATCGTGA ACCAAGCGGC GGCGCTTAGG
AGAGCCGGCT TGTGGAGGAG GGAGCTGTAC CAAGAGCTCG CGGGGGCTGC GTCTAGGGTG
GCAGAGGCGG CGGCCGCGGT CCCCCGGTGC GGAGACCCAC AGTGCGTCCT CGGCCACGAG
GGGCGCGCCG CGGCTGAGTA CTGGGCCGCC CTCTCAAAGG CCTTTGGGAC CCCCACGAGA
GATCCAAACG CCTCCGACCC CTTCAACCTC GCGCTTAACT ACGGCTATGG GATACTCCGC
TACGCCGTGT GGAGACAGGC AGTTATCCAC GGCCTCGACC CCTACGCCGG CTACCTTCAC
GTGGATAAGT CGGGGAGGCC CTCCCTAGTG CTGGACCTAA TGGAGGAGTT CAGACCCCAC
ATAGACCTCA TGGTGCTCAA GGCCAAGCCC TCCGCAGACT GGCTAGAGGG CGGGGTCTTG
AAGCGGGAGG CCAGGGCGGC GCTGGTGGAG AAGTGGCTCG AGATGAGGCT CGAGCCCACC
ATAGCGAGGC AGGTGGGGCT GGCGGTGGCC CACCTAGAGG GCAGGGGGGT ATACACGCCG
CATAAGCTAT GA
 
Protein sequence
MEVVVKEHGV SLGYSRWALV VRRRGGAAER IPIHQVDRLW ILTGGVSISS RLVRALARSF 
VDVVFFDGRG NPAARLFPPE ANGTVAHRRA QYEAYLNGRG LELAKLVVYG KIVNQAAALR
RAGLWRRELY QELAGAASRV AEAAAAVPRC GDPQCVLGHE GRAAAEYWAA LSKAFGTPTR
DPNASDPFNL ALNYGYGILR YAVWRQAVIH GLDPYAGYLH VDKSGRPSLV LDLMEEFRPH
IDLMVLKAKP SADWLEGGVL KREARAALVE KWLEMRLEPT IARQVGLAVA HLEGRGVYTP
HKL