Gene Ajs_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_0489 
Symbol 
ID4672324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp511755 
End bp512759 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content61% 
IMG OID639837619 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_984816 
Protein GI121592920 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.121384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGACA TTGCTGCCGG TGACCTCAAG GCCATACTGC ATTCCCAGAG GGCCAACATC 
TACTATCTGG AGCACTGCCG GGTGCTCGTC AATGGCGGCC GCGTCGAGTA CGTGACCGAC
GCGGGCAAGC GAAGCCTGTA CTGGAACATT CCCATTGCCA ACACCACGAG CATCTTGCTG
GGGACGGGCA CCTCGATCAC GCAGGCCGCC ATGCGAGAAC TGGCGAAGGC AGGCGTACTG
GTGGGCTTCT GCGGTGGTGG CGGTACGCCG CTTTTCGCAG CCAATGAAGT CGACGTAGAA
GTAGCTTGGC TCACACCGCA GAGTGAGTAC CGGCCCACCG AATACCTGCA GTACTGGGTG
AAGTTCTGGT TCGATGAGGA ACTGCGCCTG CACGCCGCGA AGCAGTTGCA GACCTGGCGA
CTGCAGCGGC TGGCGGCCGA ATGGAACAGC CGCTCTTTGC GTGAAGCCGG TTTTTCCATC
GATCTGGGTC GTTTGCAGAG CCTGGTGCAG CAGTTCACAC CACTGATCAG CAACGCCCCT
GATGTGACAG CGCTGCTGAC TGACGAGGCA CGCTTGACCA AGGCCCTCTT CAGGCTGGCC
GTGGAAGCCG TGGGCTACGG GGAATTCACG CGGGCCAAAC GCGGAACGGG TACCGACGCT
GCGAACCGCT TCCTTGACCA TGGCAACTAT CTGGCCTATG GCCTGGGCGC TACCGCGACC
TGGGTGTTGG GCCTGCCGCA CGGGCTGGCG GTATTGCACG GCAAGACGCG ACGGGGTGGT
CTGGTATTCG ACGCAGCAGA CTTGATCAAG GATGCAGCCA TCCTGCCGCA AGCCTTTCTG
TCGGCGATGC GGGGCGATGA TGAACAGCAG TTCCGCCGCC AATGCATAGA GGCGTTGACG
CGCAGCGAAT CATTGGATTT CATCATCGAC ACACTCAAAC ACATCGCCAG CACGACGTCG
CGGCTGGCCG ACGCGCCCCC TCCTGTTCGG GAGCCAAATG CATGA
 
Protein sequence
MEDIAAGDLK AILHSQRANI YYLEHCRVLV NGGRVEYVTD AGKRSLYWNI PIANTTSILL 
GTGTSITQAA MRELAKAGVL VGFCGGGGTP LFAANEVDVE VAWLTPQSEY RPTEYLQYWV
KFWFDEELRL HAAKQLQTWR LQRLAAEWNS RSLREAGFSI DLGRLQSLVQ QFTPLISNAP
DVTALLTDEA RLTKALFRLA VEAVGYGEFT RAKRGTGTDA ANRFLDHGNY LAYGLGATAT
WVLGLPHGLA VLHGKTRRGG LVFDAADLIK DAAILPQAFL SAMRGDDEQQ FRRQCIEALT
RSESLDFIID TLKHIASTTS RLADAPPPVR EPNA