Gene Tgr7_2433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_2433 
Symbol 
ID7317135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp2578761 
End bp2580473 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content63% 
IMG OID643617337 
ProductCRISPR-associated Cas1/Cas4 family protein 
Protein accessionYP_002514498 
Protein GI220935599 
COG category[L] Replication, recombination and repair 
COG ID[COG1468] RecB family exonuclease
[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR00372] CRISPR-associated protein Cas4 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0199418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATC AGCAGCAGCC CGAACTGCCG CTCCCTTTTC CGGAGCTATC CGGCGATATG 
CCCTTGCTCC CGGCGCGCAT GGTCAACGAA TACCAGTACT GCCCCCGTCT GGCGTATCTG
GAATGGGTGC AGGGGGAGTG GTCCGAGTCC GCGGATACCG TGGATGGACG CTACCGTCAC
CGTCGGGTGG ACAAAGGTTC TGGAGATCTT CCGGCCCCCG GCGTCGCCGA GGCGGGTGAA
CGCTACCATG CCCGCTCCAT AACCCTGTCC TCCAACCGCC TGGGCCTCAT CGCCCGCATG
GATCTGCTGG AAGGCGAGGG TGACCATGTC ACACCCGTCG ACTACAAGCG TGGCAAGCGG
CCTCATGTGG CCCGCGGCGC CTACGAGCCC GAAAGGGTCC AGTTGTGCGT GCAGGGCATG
ATTCTGGAGG AATGCGGCTA TACCTGCGAC GAAGGGGAGT TGTACTTCAC CGAATCCCGC
GAGCGTGTAC GTGTGCCCTT CGACGAGGAA TTACGCCGGT TGACGCTCAA CGCCATCAAC
GGACTACGAT TCATCGCCGC GGGTGGACAG ATCCCCGCAC CCCTGGAGGA CAGTCCCAAG
TGTCCGCGCT GTTCGCTGGT AGGTATCTGC CTGCCCGACG AGGTCAACTA CCTCCGCCGG
GAACAGACGC CGCCAAGGCC CCTGGCCGTC GCCCGGGATG AGGCCCTGCC ACTCTATATC
CAGGCCCGGG GCGCCAAGCT GGCCAAGCGG GGCGAGACTT TGGAGGTCAC GGTGGATGAC
GAAAAAGTAC AGTCGGTGCG CCTCATCGAC GTCTCCCAGG TGATCGTCAT GGGCAATGTC
TATATCACCA CGCCGTGCCT TCAGGAACTC ATGCAGCGCG AGATCCCCGT CAGTTGGCAT
TCCCATGGCG GCTGGTTCAT GGGCCACACC ATGGGCACCG GCCACAAGAA CGTGGAGATC
CGGACTGCCC AGTACAAGGC GAGCTTCGAG GAGCACCAGT GTCTGCACAT CGCCAAGGGC
CTGGTGGAGG CCAAGATCCA GAATTGCCGC ACGCTGTTGC GGCGCAACTG GAAAGGCGAG
GACAAGCCCG TGGATCTGCT CGACGGCCTG CAGGTGGATA TCAGGAAATC CCGTCGTGCA
TCCAATCTTC AGGAGCTGCT TGGCATCGAA GGGGCGGCCG CCTCCCGCTA CTTCGGTGCC
TTCGCCCGGC TGTTGAAGCA CAGCGATGCC GGGCCCGAAC TGACATTCGA CTTCACCACG
CGCAACCGCC GTCCGCCCAC GGACCCGGTC AACGCCCTGT TGTCCTACGC CTACGCCCTT
CTGACCCGGT CCTGGACGGC TTCGCTCTCG GCAGTGGGCC TTGATCCGTA TCGAGGCTTC
TATCATCAGC CCCGTTACGG CCGTCCGGCG CTGGCATTGG ACATGATGGA GCCGTTCAGA
CCCTTGATCG CGGATTCCAG TGTCATTCAG GCGATCAACA ACGGTGAAGT GCGTCCATCG
GATTTCCAGA GCGTGGCCGG CAGCGTCGCG CTGACCAACG ACGGACGCAA GCGCTTCATC
GCCACCTTCG AGCGGCGCAT GAGTCATGAG ATCACCCATC CCCTGTTCGG ATACCGGCTC
AGCTACCGCC GGCTGCTGGA GGTCCAGGGC AGGCTGCTCG CGCGCTATCT GTTGGGTGAG
CTGCCCGACT ATCCGAACTT CACGACCCGG TGA
 
Protein sequence
MDDQQQPELP LPFPELSGDM PLLPARMVNE YQYCPRLAYL EWVQGEWSES ADTVDGRYRH 
RRVDKGSGDL PAPGVAEAGE RYHARSITLS SNRLGLIARM DLLEGEGDHV TPVDYKRGKR
PHVARGAYEP ERVQLCVQGM ILEECGYTCD EGELYFTESR ERVRVPFDEE LRRLTLNAIN
GLRFIAAGGQ IPAPLEDSPK CPRCSLVGIC LPDEVNYLRR EQTPPRPLAV ARDEALPLYI
QARGAKLAKR GETLEVTVDD EKVQSVRLID VSQVIVMGNV YITTPCLQEL MQREIPVSWH
SHGGWFMGHT MGTGHKNVEI RTAQYKASFE EHQCLHIAKG LVEAKIQNCR TLLRRNWKGE
DKPVDLLDGL QVDIRKSRRA SNLQELLGIE GAAASRYFGA FARLLKHSDA GPELTFDFTT
RNRRPPTDPV NALLSYAYAL LTRSWTASLS AVGLDPYRGF YHQPRYGRPA LALDMMEPFR
PLIADSSVIQ AINNGEVRPS DFQSVAGSVA LTNDGRKRFI ATFERRMSHE ITHPLFGYRL
SYRRLLEVQG RLLARYLLGE LPDYPNFTTR