Gene Aboo_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAboo_1028 
Symbol 
ID8827985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAciduliprofundum boonei T469 
KingdomArchaea 
Replicon accessionNC_013926 
Strand
Start bp987756 
End bp990023 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content39% 
IMG OID 
ProductCRISPR-associated protein, Csm1 family 
Protein accessionYP_003483399 
Protein GI289596703 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00145134 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATA TCCAAGAAAA AGTTCTGGTT TTATCTGCTC TTTTGCATGA TATAGGAAAA 
TTGAATCAAA GGGCTTCTAC AAGGTTCAAA GAGAAGAAAC ACGAAGTCTT CGGCGCTGAA
TTTTTGAATG AGAACCTGAA TGCTAGTCCA GAGATAAAAG ATGAAACCAT TAAGCTTGTG
ATTTCTCATC ATCAAATATC TTACGAGGGA AAATACCCGG AGCTTTTGAA AATACTGAAG
GACTCGGATG GGGAATCGGC AGAGCACGAT AGAATGGAGT CTGAGGGTAT AGATACAAAT
ATCAACCAAA CTCCACTCAT ATCGGTTTAT TCCTATATAG ACATAGGATT TGGAAGCAAG
GGGAAGGTAG GTTATTCTTT GGCTTCACTT GAGAGAAAAC TCCAGTATCC AAGTACATAT
GCAACGAACT CCCAAGTTCT ATACGGTTCA ATAGAAGACA AGCTAAAGAG AGAATTAAAA
AACCTTACCA TTCAAGGCAA GGAGGATATA AACACACTCC TCTACATTCT TTTGAAATAC
ACCAAGTATG TACCATCGGC CATCTATGTT AGCGTCCCAG ATATCCCTCT ATACGACCAT
CTAAAAACCA CGGCTGCCAT AGCACTATGC AAGTACAGGA GCAATAAAAA AGAGAAGTAC
ACAATTATAA TGGGTGATTT ATCTGGGATA CAAAATTTTA TATTTTACAA TTTAAAAGGT
GGCGAAGAGC AGGTTGTGGA TGAGAAAGGT ACTAAAAGAA TGCGTGGAAG GAGCTTGCTG
ATAAACTTAA TAATAGATTC TGCAGTGAGG TATATACAAG AAGAGCTTGA CTTGTATGAT
TTCAATATCC TATGGCAAAG CGGTGGAAAT TTCCTGATGC TTGTTCCTAA TGTAGACGGC
ATAGACGAGA AGCTAGTGGA AATGAAGAGA AAGGTGAATG AGTTCCTCTT GAACGAATTT
GGAAGGCTAT ACCTCAATAT TGCTTGGATT CACAAGGATA ACTTGAACAA TTTCAGCGAA
ATACTGTATG AATTGCATTC AAAGATGGAC GAAGAGTCAA AGTCCAAGAA ATATATTGAG
TTTGTGAGAG ATGAGGATTT CTATGTTAGC CCTTCAAGAG ATAAATACAT ATGCCCAGTT
TGTGGAGTGC ATTATGTGAA TGATTCAAAT GGGATATGTG ATACCTGCCA GAGAACAGCG
GAGCTTGGGG GATTCATCGG AAAAGGCCAG TATTTGATAA GAAGCTTAGG CAAGGATGGA
CATTTCACAT TCAAATATGG AGATCTCAGA ATTTCTTACA CAATATCGGA GGATTTCTAT
GGATACGAAG ATGATGAAGT GTTTTCTATA GAGGATTTCA ACATACCTTC GGTGGGTAAG
GTTAGAGGGT TCAAACTCTT GAAAACATAC ATTCCCAGTG TATATGGCAA AGTCCTTAGC
ATCTCGAAGC TTTTAGCTCC CGGCACTTCC TTAGAAGATA GAATGACCTC TAACGCCACT
ACAAAGATGG GAATTTTCAA AGCGGATGTT GATTGCTTAG GTGAGATATT CAAGGAAGGT
TTCAAAGAAG ATCTGAGGAG AATAAGCAGG ATATCAACCC TTTCTTTCTT GATGGACCTG
TTTTTCTCCG TGGAAGTTAA CAATATTGCT AAGAAGAACA ACATTTATGT TATATTCTCT
GGTGGGGATG ACCTTACAGC AGTGGGCAGG TACGATGAGA TAATTAAATT TGCGTTGGAT
GTGCGAGATT CCTTTGCCAA GTGGACAGGG GGTAATGAGA ATTTGCATAT ATCCGCATCT
ATAGTTTTCT TTGATGAGAA ATTTCCTATT CGCAGGGCTG TTAATGTTGC TGAAGAGCAT
CTTGGAGATG CAAAGGATTA CTCTGCAGAT GTGAGCATGT GCGTAGAGAG AGGAAATAAG
ATAAAGATCT TTGAAGATGT ATTGTCTTGG GACGATTTCA AAGCACAGGT GGACATGGGC
AACGAACTTT GGGAAGCGCA GAAAAAAGGA GAAATTTCAT CATCTTTATC GCATGTTCTC
TTGGTTCTCC ATAAGCTTAG TCCTTCATTC TTTGCAGATA AACCTCTGGG GAAAGGAGAT
GTTGTTATTC CCAGTCCAAA GCCATATTTG AGGTATTATT TTGCTCGTAG GAAAGGAAGT
AGATACGATT TGCTTGATAA ATTATCCCAA GAGGGTATTT TCAAGCACAT TCCAGTTGGA
GTGTCAATAT GGGTTATGAG TAGAAAGTAT GGAAAAGAGG TGAGTTAG
 
Protein sequence
MNDIQEKVLV LSALLHDIGK LNQRASTRFK EKKHEVFGAE FLNENLNASP EIKDETIKLV 
ISHHQISYEG KYPELLKILK DSDGESAEHD RMESEGIDTN INQTPLISVY SYIDIGFGSK
GKVGYSLASL ERKLQYPSTY ATNSQVLYGS IEDKLKRELK NLTIQGKEDI NTLLYILLKY
TKYVPSAIYV SVPDIPLYDH LKTTAAIALC KYRSNKKEKY TIIMGDLSGI QNFIFYNLKG
GEEQVVDEKG TKRMRGRSLL INLIIDSAVR YIQEELDLYD FNILWQSGGN FLMLVPNVDG
IDEKLVEMKR KVNEFLLNEF GRLYLNIAWI HKDNLNNFSE ILYELHSKMD EESKSKKYIE
FVRDEDFYVS PSRDKYICPV CGVHYVNDSN GICDTCQRTA ELGGFIGKGQ YLIRSLGKDG
HFTFKYGDLR ISYTISEDFY GYEDDEVFSI EDFNIPSVGK VRGFKLLKTY IPSVYGKVLS
ISKLLAPGTS LEDRMTSNAT TKMGIFKADV DCLGEIFKEG FKEDLRRISR ISTLSFLMDL
FFSVEVNNIA KKNNIYVIFS GGDDLTAVGR YDEIIKFALD VRDSFAKWTG GNENLHISAS
IVFFDEKFPI RRAVNVAEEH LGDAKDYSAD VSMCVERGNK IKIFEDVLSW DDFKAQVDMG
NELWEAQKKG EISSSLSHVL LVLHKLSPSF FADKPLGKGD VVIPSPKPYL RYYFARRKGS
RYDLLDKLSQ EGIFKHIPVG VSIWVMSRKY GKEVS