Gene Gura_0835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_0835 
Symbol 
ID5166021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp992075 
End bp992998 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content65% 
IMG OID640548333 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001229616 
Protein GI148262910 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.855521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAC CAATCCTCCC CCCATTGAAG CCGCTCCCCA TCAAGGACCG CATCTCGGTC 
GTTTACGTGG AACGGGGCAA CCTGGATGTC CTTGACGGCG CCTTTGTGGT CGTGGACAAG
ACCGGCGTCC GCACCCATAT CCCCATCGGC GGGGTGGCCT GCCTGATGCT GGAGCCGGGG
GCGCGGGTTT CCCACTCTGC CGTGGTGCTG GCGGCGCGGG TCGGGTGTCT GCTGGTCTGG
ATCGGCGAGG CCGGGGTGCG CATGTATGCC GCCGGTCAGC CGGGGGGTGC CCGGGCCGAC
CGGCTTTTGT ACCAGGCAAA GCTGGCCCTG GACGATACAT CGCGGCTGAA GGTGGTGCGC
AAGATGTACG CGATCCGCTT CCAGGAGGAG CCGCCGGAGC GGCGCAGTGT GGACCAGTTG
CGCGGTATCG AGGGGGTGCG GGTACGGAAA ATGTACGAGC TGCTGGCCCG GCAGCATGGG
GTGGAGTGGC AGCGCCGCAA TTATGATCAC AGCGAATGGG GGAGCGGCGA TGTGCCCAAT
CGCTGCCTTT CTTCGGCCAC CGCCTGCCTG TACGGCATCT GTGAGGCGGC CATCCTGGCG
GCAGGGTACG CCCCTGCGAT CGGTTTCATC CACACCGGCA AGCCCCAGTC ATTTGTCTAC
GACGTGGCCG ATATTTTCAA ATTCGAGACG GTGGTCCCGG TGGCGTTTCG TATCGCCGCC
AGGCAGCCCC GCAACCCGGA ACGCGAGGTG CGGCTGGCCT GCCGGGATGC CTTCCGTCAA
TCCAAGCTGC TGCAGCGGAT CATTCCTACA ATCGAGCAGG TGCTGGCGGC TGGCGGGCTG
GAGGTGCCGA AGGCCCATGA AGAGGCGGTA GTGCCCGCCA TTCCAAACAA GGAGGGCCTT
GGTGATCCGG GGCAAAGAGT TTGA
 
Protein sequence
MTEPILPPLK PLPIKDRISV VYVERGNLDV LDGAFVVVDK TGVRTHIPIG GVACLMLEPG 
ARVSHSAVVL AARVGCLLVW IGEAGVRMYA AGQPGGARAD RLLYQAKLAL DDTSRLKVVR
KMYAIRFQEE PPERRSVDQL RGIEGVRVRK MYELLARQHG VEWQRRNYDH SEWGSGDVPN
RCLSSATACL YGICEAAILA AGYAPAIGFI HTGKPQSFVY DVADIFKFET VVPVAFRIAA
RQPRNPEREV RLACRDAFRQ SKLLQRIIPT IEQVLAAGGL EVPKAHEEAV VPAIPNKEGL
GDPGQRV