Gene Tneu_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1140 
Symbol 
ID6165863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1028728 
End bp1029600 
Gene Length873 bp 
Protein Length290 aa 
Translation table11 
GC content67% 
IMG OID641668291 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001794516 
Protein GI171185597 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.511238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000356789 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCGCAT ACGGCGCCAG GATAAGGGCT AGGAAGGGCC TCCTCCTCGT GGAGACAAAG 
GAGGGCGCCA GGGAGTACCC CCTACACGAG GTAGACGAGG TCCTCCTACT CACCGGCGGC
ATATCCATAA CGACGAGGGC GCTCAGGGCC CTCCTCGCCG CCGGGGCCAC AGTCGCCGTC
TTCAGCCCCC GCGGGGAGCC CCTGGGCATA TTCATGAAGC CCATCGGAGA CGCCACGGGG
GCCAAGAGGA GGTGCCAGTA CAAGGCGGCG GAGGACGGCA GAGGGCTACA GTACGCCAAG
AGCTGGGTCT TCAAGAAGAT GCTGGGCCAG AGAGACAACA TCAAGGCCTG GCGCCGCCGC
CTAAGAGGCT ACAGCCAATA CGCCGAGTCC CTAGCCAAGG CCCTACAGGC GCTGAGAGAC
GCCGCCTCCC CCCACGCTGT CTTGGAGGCC GAGGCGGCGG CCGCCGAGGC CTACTGGGCC
GCCTACAGGG AGGTCACGGG GTTCCCCGGC AGAGACCAGG AGGGGAGAGA CCCCGTCAAC
GCCGGCCTAA ACTACGGCTA CGGGATCTTG AAGGCCCTGG TCTACAAATC CCTGATCCTC
GCCGGGCTGG ACCCATACGT CGGCTTCCTC CACGTAGACA AATCCGGGAG GCCCTCCCTA
GCGCTGGACT TCATGGAGCA GTGGAGGCCC CGCGTCGACG CCGTCGTGGC CAAGATGGCG
GACAAGCTGG AGTCCGAGGG CGGCCTACTC ACCCGCCGGT CCCGCCTGGA GCTGGCCGCC
GCCGTCCTGG AGGAGCTCCA CGCCGCCAAG AGGCCCCTCT CCGCCGAGAT CCACAGAGAG
GCCAGAGCTC TGGCGCGCTC CATATGTACA TAA
 
Protein sequence
MAAYGARIRA RKGLLLVETK EGAREYPLHE VDEVLLLTGG ISITTRALRA LLAAGATVAV 
FSPRGEPLGI FMKPIGDATG AKRRCQYKAA EDGRGLQYAK SWVFKKMLGQ RDNIKAWRRR
LRGYSQYAES LAKALQALRD AASPHAVLEA EAAAAEAYWA AYREVTGFPG RDQEGRDPVN
AGLNYGYGIL KALVYKSLIL AGLDPYVGFL HVDKSGRPSL ALDFMEQWRP RVDAVVAKMA
DKLESEGGLL TRRSRLELAA AVLEELHAAK RPLSAEIHRE ARALARSICT