Gene Sare_3114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3114 
Symbol 
ID5706554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3541012 
End bp3543366 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content75% 
IMG OID641272546 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001537913 
Protein GI159038660 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0170569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAGGA TGGCCGACCG GCCGGCGACC CAGCGACCGG TGCCGCCGTC CGGCTCTGCC 
TCCGGAGGCG TCGGCAAGGG GTCTCTCTGG TCGGAGCTGC CGCTGGCGGT GGTGTTGGTC
GTGCTGGTGG GGCTGGCCGG GGTCGTCCTC GGGCGGATGT ACGCCGGGAG CCTGCTCACC
GGTCTGGTTG TCGGTGCCGC CATCGGCTCG GTGCTACTCG GAGTGGCCTG CCGGCGGCTG
CCGTCGTGGC TGGTGGCGCC GCTGTCGGTG CTCGGCCTGG CGACCTGGAC AGTGTGGTCG
CTGCGGCTGG CTGCGGATCG TGCCGACCTG CCCGGTCCGC TCACCGAGGT GGCCGGCGAG
GCCGCGCGCA ACGGTATCCC CCGGCTGTTG ACCGCGATGA TCCCGGTCGA GCCGACGCCG
GACACGGTTC TCGTTCCGGT GGTCGCTGCC TGGCTGGCGG GGCTGGCTGC CGGCGAGGTG
GCGCTGCGGG CGGGTCGGGT GTTGCTCGGC TATCTGCCGC CGGTGCTGCT CTACGCGGGT
GCCGTCTACC TGGTCGGGCC GAACGCCGCG CCGGCGGTAC GGCCGACGCT GGTGTTCGTG
GCGGTGGCGG CGCTTGGGCT CGCCCTGCCG GGACGCGTCC GGCCGGGCGG TGCGGACCAC
ATGGCGGGGC TGCCGCCGAA GGTTCGCGCG GCGGCGCGGA TCCGGCTCGC CGGGGCGTCG
ACGGCCGGCG TGGCGGTGGT GGTGGCGCTG GCTGTCCTGC TCGGTCCGGC GCTCGCCGCT
CGGATCGGGG ATCGGCCGGT TGATCCGCGC CGGTATGTGG AGCCGCCCCG GGTGGACTCG
CTGGACGAGA ACCCGCTGAT CCGGATATCG GGCTGGGCGT TGCACCCGGA GCAGAAGCTG
CTGGACGTGA CGAGCCGGTC CGGCGGCGAC GCGGACCAGG GTGGGGACGC GACGCGGATC
CGGTTGGCAG TTCTCAGTGA CTACGACGGG GTCACCTGGC AGGTCGGCGC GACGTACCGC
AACGCTGGCC GGATCCTCCC GGCGCCGGCC GTCGCCCCCG GCAGCATTGT CGAGCCGGTA
CGCCAGGACA TCACCGTCGC TGGGCTGACC GGGCGGTTGT TACCCGCCGT GGCCACCCCA
CGGGAGGTTA CCGGGGCACG GGTGGCGTAC GACCCGGCGA CCGGCACGTT GATCCACCCG
GAGGGGCTGA CTCCGGGGTT GCGGTACACG GTGACGTCAG CCCAGGAGCA GCCGGAGTCC
AACCTGCTGG CCATCGCGGA CGTGCCGGCC GGCGAGGAGG TGGCTCGGGT GCTGCGGGTG
GCCGACGGGG TGCCGGAGCC ACTGCGCCGG TTGGCGTCCC AGCTCGCCGA GTCCGGTGGC
GCCCCGTACA CGCGGGCTGC CGCTGTGGAG CAGTTCCTCG CCGAGCACTA CCGTCTGGTC
GCGGACGCGC CGAGTGGGCA CGCGTACCCG AATCTGGCGT TCTTCCTGTT CGGCCCACGC
GACGCGGGCG GGCAGCGGGG CACGTCGGAG CAGTTCGCGG CCGCCTTCGC GGTGCTGGGT
CGGCTGACGG GGCTACCGAC CCGGGTGGTG GTCGGCTTCC GGCCGTCAGG CGACGGGCCG
GTGCGGGCCG GGGACGCGTA CGCCTGGCCG GAGGTGTTGT TCGACGAGCT GGGCTGGGTA
CCGTTCGATC CGCTGCCCCG ACCGGACACC GAGCCGCGGC CGGTCGAGGA GGGTTTACGT
CCCCCGCCGG AGGAGCCGCC GTCACCGGCT CCGGAGCCGA GCGCCGATCC GACGGTCTCC
CCGCAGGCGT CGGGTGCGCC CGAGACCGCA GCGGGTCCAG GTGGCAGGTC AACGCCGGTG
CTGCTGGTGG GCGGTGCTGC TGCGGGAGGC GGACTTGTCC TGCTGGCGTT GCTGGTGACG
CTGCTGTGGC TACGCCGCGC ACTGAGCCGG GGCCGGCTGG ACCGGGGCCC TCCCGGCGAA
CGGGTGGGCG GTGCCTGGCG GGAGGTGACC GACGCGCTGC GACTCGCCGG GCACCCGGCC
ACCGGCGATC TGGCGGCCAC CGAGATCGCC GTGTTGGCTC GCGACGTGCT GGTGGACGCA
CACGGGGCCG GGGCTGAACC GGCGGCGGTG GAGGTCGCGG AGTTGGCGGA GTTGGCGGAC
CTGCTCAACC GGGCGGCGTT CGCACCGGGC ACGCTCACCG CGCAGCAGGC GAATCGGGCA
GCCACGATCG CCCAAGCGTA CGTGGCGGGG CTGCGGGCGA CCCGGCCGTG GTGGCGCCGA
CTGCTGTGGT CGGTCCACCC TGGCCCGTTG CGCCGTGACC GGGCGGATCG GCGCCGGCGG
GACGCGCGGG TGTGA
 
Protein sequence
MVRMADRPAT QRPVPPSGSA SGGVGKGSLW SELPLAVVLV VLVGLAGVVL GRMYAGSLLT 
GLVVGAAIGS VLLGVACRRL PSWLVAPLSV LGLATWTVWS LRLAADRADL PGPLTEVAGE
AARNGIPRLL TAMIPVEPTP DTVLVPVVAA WLAGLAAGEV ALRAGRVLLG YLPPVLLYAG
AVYLVGPNAA PAVRPTLVFV AVAALGLALP GRVRPGGADH MAGLPPKVRA AARIRLAGAS
TAGVAVVVAL AVLLGPALAA RIGDRPVDPR RYVEPPRVDS LDENPLIRIS GWALHPEQKL
LDVTSRSGGD ADQGGDATRI RLAVLSDYDG VTWQVGATYR NAGRILPAPA VAPGSIVEPV
RQDITVAGLT GRLLPAVATP REVTGARVAY DPATGTLIHP EGLTPGLRYT VTSAQEQPES
NLLAIADVPA GEEVARVLRV ADGVPEPLRR LASQLAESGG APYTRAAAVE QFLAEHYRLV
ADAPSGHAYP NLAFFLFGPR DAGGQRGTSE QFAAAFAVLG RLTGLPTRVV VGFRPSGDGP
VRAGDAYAWP EVLFDELGWV PFDPLPRPDT EPRPVEEGLR PPPEEPPSPA PEPSADPTVS
PQASGAPETA AGPGGRSTPV LLVGGAAAGG GLVLLALLVT LLWLRRALSR GRLDRGPPGE
RVGGAWREVT DALRLAGHPA TGDLAATEIA VLARDVLVDA HGAGAEPAAV EVAELAELAD
LLNRAAFAPG TLTAQQANRA ATIAQAYVAG LRATRPWWRR LLWSVHPGPL RRDRADRRRR
DARV