Gene Acry_1808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1808 
Symbol 
ID5161899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1991304 
End bp1992266 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content61% 
IMG OID640553724 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001234929 
Protein GI148260802 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGATC AGGCGCCGGG TCTGCCGCCG CCGCGACCCA TTCCAATTCG TGAACGCTCA 
TCTATCCTGT TTCTCGAGCG CGGACAGCTC GACGTGATCG ATGGGAGCTT TGTTGTCGTC
GACCAGCGTG GCATTCGGAC CGTAATTCCG GTCGGCGGCA TTACATGCCT GATGCTGGAG
CCAGGCACAC GTGTAAGCCA TGCCGCTGTT TGCCTAGCTG CACGAGCGGG CACATTACTC
ATCTGGGTTG GAGAGGCTGG TGTGCGCCTC TACGCTGCCG GCCAGCCGGG TGGTGCGCGG
TCCGACCGGC TACTCTATCA GGCGCGACTT GCGCTCGATG ATGAGGCTCG TCTCCGTGTC
GTCCGAAGAA TGTATTCTAT CCGCTTCCAG GAAGAGCCGC CCCAGCGCCG CTCTGTCGAT
CAACTGCGCG GCATTGAGGG CGTCCGCGTG CGGGAGCTCT ATAAGCTGAT GGCCCAGCGC
CATGGCGTCT CCTGGGCCGG GCGGCGTTAC GACCCCCAGA ACTGGGGTGG TGCTGATCTT
GTCAATCGTT GTCTATCTGC TGCGACAGCT GCCCTATACG GGATCTCGGA GGCGGCAATC
CTGGCCGCCG GGTATGCGCC GGCGATCGGC TTTTTACATA GTGGCAAGCC GCAAAGCTTT
GTTTATGACA TTGCGGATCT CTTCAAGTTC GAAACGGTCG TGCCGGAGGC ATTCAAGGTA
GTGGCGGCAG TGCAATCTGG ACGTGGACAG ATTGATGGGA TGGCGATCGG AGAACCTGTC
GGCGCGGTCC GGCGGCGCTG CCGCGATGCG TTCCGCCGCA CAAATGTACT CGCACGCATC
ATTCCCGCTA TCGAGGATGT GTTGGCAGCC GGCGGCTTGG CAGTGCCAGA GGCGCCCGAA
GAAGCGATGC CCGTTGCTAT ACCACCACCG GAACCGACGG GTGATGCTGG TCATCGTGGT
TGA
 
Protein sequence
MSDQAPGLPP PRPIPIRERS SILFLERGQL DVIDGSFVVV DQRGIRTVIP VGGITCLMLE 
PGTRVSHAAV CLAARAGTLL IWVGEAGVRL YAAGQPGGAR SDRLLYQARL ALDDEARLRV
VRRMYSIRFQ EEPPQRRSVD QLRGIEGVRV RELYKLMAQR HGVSWAGRRY DPQNWGGADL
VNRCLSAATA ALYGISEAAI LAAGYAPAIG FLHSGKPQSF VYDIADLFKF ETVVPEAFKV
VAAVQSGRGQ IDGMAIGEPV GAVRRRCRDA FRRTNVLARI IPAIEDVLAA GGLAVPEAPE
EAMPVAIPPP EPTGDAGHRG