Gene BLD_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBLD_1960 
SymbolhsdS3 
ID6363795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBifidobacterium longum DJO10A 
KingdomBacteria 
Replicon accessionNC_010816 
Strand
Start bp2338749 
End bp2339990 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content46% 
IMG OID642681169 
Productrestriction endonuclease S subunit 
Protein accessionYP_001955903 
Protein GI189440822 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.323459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC AGGCGAAAGT TCCTGCGATT CGCTTTGCTG GTTTTACTGA CCCTTGGGAA 
CAGCGTAAGT TGGGCGAGAT TGCCGATAAG GTGACAGAAA AAAATCTTGA TGGAAACATC
ACCGAGGTTC TTACCAACTC TGCAGAATAC GGTGTAATTA ATCAAACCGA GTTCTTCGAC
CACGCTGTCG CCAAGGAATC CAACATTGCT GGTTATTATG TCATTGCTCC AGGGGATTTC
GTGTACAACC CTCGCATCTC CGCAACAGCG CCTGTTGGCC CAATCCGTAG GAATACGTTG
GGAATACACG GAGTTATGTC TCCTCTCTAC ACTGTATTCA GGCTTACAGA TGCAGTCGAT
GGAACTTATC TCAGCCACTT CTTCAAGACA AATGGCTGGC ATGGTTTCAT GAAGCTGGAA
GGTAATTCGG GAGCCAGATC AGATAGGTTC TCAATCGGTG ATGCGACATT CTTTGAAATG
CCAATCCCAG TTCCATCTTC AAGTGAACAA TATGCTATAG GCTCCTTCTT TTCCCGTCTC
GACGACCTCA TCACCCTTCA TCAGCGTAAG TATGACAAGC TCGTCATCTT CAAAAAATCG
ATGCTTGAAA AAATGTTCCC GAAGGATGGC GAATCTGTAC CCGAAATTCG CTTTGCTGGT
TTTACTGACC CTTGGGAACA GCGTAAGTTG GGCGAGATTG CCGATAAGGT GACAGCAAAA
AATCTTGATG GAAACATCAC CGAGGTTCTT ACCAACTCTG CAGAATACGG TGTAATTAAT
CAAACCGAGT TCTTCGACCA CGCTGTCGCC AAGGAATCCA ACATTGCTGG TTATTATGTC
ATTGCTCCAG GGGATTTCGT GTACAACCCT CGCATCTCCG CAACAGCGCC TGTTGGCCCA
ATCCGTAGGA ATACGTTGGG AATACACGGA GTTATGTCTC CTCTCTACAC TGTATTCAGG
CTTACAGATG CAGTCGATGG AACTTATCTC AGCCACTTCT TCAAGACAAA TGGCTGGCAT
GGTTTCATGA AGCTGGAAGG TAATTCGGGA GCCAGATCAG ATAGGTTCTC AATCGGTGAT
GCGACATTCT TTGAAATGCC AATCCCAGTT CCATCTTCAA GTGAACAACA TGCTATAGGC
TCCTTCTTTT CCCGTCTTGA CAACCTCATC ACTCTTCATC AGCGTAAGTT GGAATTGCTG
CAGGATATCA AGAAATCTTT GCTTGACAAG ATGTTTGTGT GA
 
Protein sequence
MTEQAKVPAI RFAGFTDPWE QRKLGEIADK VTEKNLDGNI TEVLTNSAEY GVINQTEFFD 
HAVAKESNIA GYYVIAPGDF VYNPRISATA PVGPIRRNTL GIHGVMSPLY TVFRLTDAVD
GTYLSHFFKT NGWHGFMKLE GNSGARSDRF SIGDATFFEM PIPVPSSSEQ YAIGSFFSRL
DDLITLHQRK YDKLVIFKKS MLEKMFPKDG ESVPEIRFAG FTDPWEQRKL GEIADKVTAK
NLDGNITEVL TNSAEYGVIN QTEFFDHAVA KESNIAGYYV IAPGDFVYNP RISATAPVGP
IRRNTLGIHG VMSPLYTVFR LTDAVDGTYL SHFFKTNGWH GFMKLEGNSG ARSDRFSIGD
ATFFEMPIPV PSSSEQHAIG SFFSRLDNLI TLHQRKLELL QDIKKSLLDK MFV