Gene Dred_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_1007 
Symbol 
ID4956261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp1078392 
End bp1079423 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content40% 
IMG OID640180177 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001112367 
Protein GI134298871 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA TGTTGAATGT TCTTTATATA ACAAATCCTG AGGCTTATTT GGCAAAAGAT 
GGGGAAAACC TTGTGGTAAG AGTTCAGGAT GAAGAAATTT TCAGAACTCC TATTCACTAT
CTGGAGGGTA TAGTTACCTT TGGTTATATG GGTGCAAGCC CCGCTTTACT GGGAATGTGT
GTTGAGAAAG GGGTTACGGT ATCTTTTTTA ACTGCCCATG GTAAGCATCA GGCTACAGTA
CATGGAACTC CCAAGGGCAA TGTGCTATTG AGAAGAAAAC AGTACCGTCT GGCTGATTCT
GAAAGTGAAT CTGCAAAGCT GGCTTCAATG TTTATTATCG GGAAAATTGC TAACTGCAGG
ACGGTACTCC GCCGTTTTAT GAGTGATTAC GGAGATAAGG TTCAAATTGA AGAAGTTGAT
CGTGTCTCTA AAATAATGGC TCGTAATGTA TTGAGGCTAG GCAAAGAACT ATTGCTTGAT
GAAGTGAGAG GAATTGAAGG CGAATCAGCA CAAATGTATT TTTCTGTATT TGACCAACTA
ATTATATGTC ATAAGGACCA CTTTTTTATG AAAGGAAGAA ATCGCAGACC GCCATTGGAC
AACATGAACG CATTGTTGTC ATTCCTATAT AGTCTACTTT TGCATGAGAC CCGATCCGCT
CTAGAAACTG TTGGATTAGA TCCATATGTT GGTTTTTTAC ACCGTGACCG ACCGGGTAGG
ACCGGGTTGG CCCTAGACCT TATGGAAGAA TTTCGGCCGT ACTTGGTAGA CAGATTAGCA
TTAAGTTTAA TTAATAGACG GCAAGTCACA GGAGATGGAT TTTTGAAAAA GGAATCGGGT
GGAGTTATTA TGAAGGAGAA TGTTCGAAAA ATCGTGATAG AAGCCTGGCA AAAAAGAAAG
AGAGAAGAAA TAACCCATCC ATTTCTGGAA AAAAAGATAT ATGTTGGTTT ATTGCCCTAT
GCGCAAGCCC TATTGTTAGC TAGACATTTA AGGGGAGACT TAGATCGATA TCCTCCGTTT
GTATGGAAGT AA
 
Protein sequence
MRKMLNVLYI TNPEAYLAKD GENLVVRVQD EEIFRTPIHY LEGIVTFGYM GASPALLGMC 
VEKGVTVSFL TAHGKHQATV HGTPKGNVLL RRKQYRLADS ESESAKLASM FIIGKIANCR
TVLRRFMSDY GDKVQIEEVD RVSKIMARNV LRLGKELLLD EVRGIEGESA QMYFSVFDQL
IICHKDHFFM KGRNRRPPLD NMNALLSFLY SLLLHETRSA LETVGLDPYV GFLHRDRPGR
TGLALDLMEE FRPYLVDRLA LSLINRRQVT GDGFLKKESG GVIMKENVRK IVIEAWQKRK
REEITHPFLE KKIYVGLLPY AQALLLARHL RGDLDRYPPF VWK