Gene Sare_4824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4824 
Symbol 
ID5707945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5453698 
End bp5457051 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content70% 
IMG OID641274220 
Productcoagulation factor 5/8 type domain-containing protein 
Protein accessionYP_001539565 
Protein GI159040312 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.90345 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGGTA GCGGCAACAC AGTGCAACCG GCGACCTCGG GGACGTCCGC GCTGACGTGG 
TATTCCGAAC GTGCCGTGGT GGTCGGCGAA CCGTCGGCAT TCGCGGTAGT GGCCCGATCG
CTGCCGGTGC CGCCAGGTGA GGTCGCGGTG CTGGGACAGC CGAGCGGGTC GGTCGACTGG
CGGGCCGTGG CTGATGCACT GTTGGCCGTC GTGTCCTCAG GGGCGGTGGT ACGGCTGGCG
GTGCCGGGAG TTGGTAGCCC GGACACCTCG GGGCAGGTGC CGGCGGCGGT TCTGGCCGAG
CACGCGGGAT TCGAGGTTCT CGCGCCGGCC GGACGCCTGG AGCCGATTCC CGGCGGCACG
TTGTTCGCGG ACGAGGGCTG GCGGCGGTTC GGGGCAGGAT CGTCGTCGTC CGCCGGCGGC
GGCCAGCACG ATTCGGCTGT GGGCCGGCGG TTTCCCGCGC CCCGCTGGCA ACCTGCCGTC
GACGCCACGG CCGATGGCGG ATCAGTGTCC GGCCTGCTCC ATACGCCGAT CCCGGCGGGC
CTGTGGGTGT ACCCCGACGC GCCCGACCAG CCTGCTCCGG GTCGGGACGA CCCCGCGTAC
GCCATACCGG TGGATGCGGA CCGTCCGGTG TTGCTGGTGG GACGGCCGGG CGTTGCGGCA
CCTGACCCAG ATGCGGTCGT GGCAGTGGCG GACGCGCTGC CGAGTACCCT GCGCGGACGC
CTGACCGTGG TGCCCTATGG CCCGGGAGCG ACGGTCAGCG CGCAGGTGTG CCAGATGCTC
GCCCGGCGAT GGGCGTGCAC CGTGACCATG GACACCGGGG TACCGACACT GCACCACGAC
AATCGTGTGG TGCCGGTCGT AGTCGATTCG GAGGACGTCG GTGGGTGGGT GCCGCCGATA
TCGCAGCTCG CCTTCGCGCC ATCCGGCCAG CCGAGTCCGG CCGGGTCGGT CGACTTTCTG
GTCGGGCTGG CCCGGGTCGA ACCGGACGCC TACCAACTCA CCGCACAGTG GGTGGTCGAG
GTCGTCCAAA GCGGGGTGTG GCTTCGTCCA CCGCTGTCGG GAGAGTCGGC CCCAGCGGTG
CGTGAGCGGC CCTGGCGGTC CGGGCGGATG GCGATCGTCG TTGGGGTGCC GGGCGCACCG
GTGGACGACG AGGCCCGGCG GGCGCTGCGG GAGCTGCTGG TCCGGCTGCC TGCTCCGGTA
CGAGGACGGG TCGACCTGTA TCCCCGGGAG GCCGAGTCCC TCGCGCGTGA GTACACACCG
CCGACCGACT CACCCGACAT CGCGCTCGTC CCCGCGCGGA CCGAGCCGCC GATGTGGTGG
CAAGGCGACG AACGGTTGTT CAGCGTCTTG GTCGAGATCG ATGACGACGG CCAGGTGCGC
ACCGAGGAGG GTGCGCTCGG ACCCGGTGAG CTCGGTGAAC TGATCACGGC CCATAGTCAG
CGAGCCGGCC GACCGATCCT GCTCGTGTCG AATACCGACG TGCCCGTGGA GTTCTGCCAG
CGCGTCGCCG ACCAGGTCCC GGCGATCGTG ATCACCGGCA GTCCGCACGC GGGCGGGTGG
TCCACCATCC GCCCGCGTCA GGTAGGTCAC GACGAGTTGC CACCGTCGCG TCTGGAGAGC
CCGTATCCGT TGGCCGACCA CGATCTGGAC GACATGCTGA ATGAGCAGGC GGTTCCGGCC
GCGGAGGTGA CATGGCAGCC GGCGTTCGCG AACCGGCGGA GGCTGGACCT GCGAACAAAA
CGGTCTGTGG CGGATACGGG TGACGGAACC ACGGTGCCGG GCGAGACGGA CGCACCCGAT
CGCGTCGCAC TATTGGGCCG TGGTCCCGGC GCTCGCGCGC CGTTTCGGGT GTTCGGCCGG
CTGGCGAGTG AGTGGCACGT GGCCCTGACC ATAGCCACGC AGCGGCCGGA GTGGATTCAC
TGTGACCACC TGATCACGGT CGAGATCGAC GCGGTTGATC CGCTGAGTTT GCAGCTACTG
GCGAACTACC TGGATGCGCC CTTGGCCGCG CGGCGACAGC GAGTTGACGG GCGTCCGGCA
TCCAGCGACC AGCCTGGGTG GATACTGGCT CGACCTCGTC GGGTGGTGGG GCCCACCACA
AGCGGCTTCC GGATGCCTGG TCGAGGTGAG GCGATGGACG GAACGCCGAC ACCACGGCGG
GCGATCGCGC TACCTGATAC GGCCCACCTG TCCACCTTCC CTGTCCCGGC CGAGCAGGCC
GCCGTTACTG CACCGGATCC GACCCCTGCG GAGCCGGATC CGGTGCCCGA CAGCACGACA
GCACCGCCGG CGCCGCCATC GACGCCAACG CGAACGCCGG TGGCGGCTTC CACTACCCCG
GACCCAGGCC CACACACCGT GGAACCCGAA GCCTCGACGA GTGGTGGGAG CGCAATCTCG
GGTGCACCGG CCTCCGGCGC GTTGCCGGCC CTGACGGCGC CGGTCGAGGA AGCCGTGCCA
CTGCTGGCCC AGACAGGTGG CGGCGCAATC TCGGGTGGAC AGGCCTCCAG TGCACTGCCA
GACCTGACAG CGCCGGTCGA CAAAGGCGTG CCGCTGCCCG TGCTCGCGTC CACGTCGTCA
CCCGCCCGCG CTGACGCGCG GGGCCCCCGC CGTCGTCGCA GGCTGGCGGG AGCGGTCCTG
GTCGCGGCCA CCGTCGCGGC CACTGGCGGT GCTGTAGTCG CGCTGCGCGA TGCTGTGGAC
GCGGGTGCCG CCGACGTGCG GAGGGCTGCG GAGGTGACGA CGTCATCGGA ACCGGCGCCC
ACCGCCGGAG GCGCGGCGCC GCAGCTCACG GTGAGTCCGA GCGCACCCCC GACCGCGTCC
GAGCACCCCA GCCGGACACC CGCGCCCGCT GCGTCGATGT CTCCGGTGAC GAGGCCAACA
CGAGCGGGCG CCGAGCAGAC GCCACCTGCT GCCGCCGGTC GACCGACCTC GACGTCGACC
ACGGTTACGG GTCGGCCGAA CACCACGAAA CGTAACCTGG CCCTCGGTCG TGTGGCGGCC
GCCTCCAGCA CGGAGACGTC CACGTTGGCG GTCGAGAACG TGGTCGACGG TGACCGGACG
ACCCGTTGGT CCAGTGAATG GAATGACGAT CCGCAGTGGT TGGGAATCGA CCTTGGTTCC
GTCTGGGCTG TCACCGAGGT CCGGCTGATC TGGGAGGAGG CCTACGCGAC CAGGTACCGG
GTCGAGCTTT CACCTGACGG AGCGCACTGG ACGGCCGTCT ATTCCACAGG TAACGGAGCC
GGTGGCACCG TCAGGATCAG CGTGACGTCG GCGGCTGCGC GATACGTCCG GCTCGTGTTC
GACCAACGTG GCACCATGTG GGGCTACTCG TTGTGGGAGC TCGACGTTCG TTGA
 
Protein sequence
MDGSGNTVQP ATSGTSALTW YSERAVVVGE PSAFAVVARS LPVPPGEVAV LGQPSGSVDW 
RAVADALLAV VSSGAVVRLA VPGVGSPDTS GQVPAAVLAE HAGFEVLAPA GRLEPIPGGT
LFADEGWRRF GAGSSSSAGG GQHDSAVGRR FPAPRWQPAV DATADGGSVS GLLHTPIPAG
LWVYPDAPDQ PAPGRDDPAY AIPVDADRPV LLVGRPGVAA PDPDAVVAVA DALPSTLRGR
LTVVPYGPGA TVSAQVCQML ARRWACTVTM DTGVPTLHHD NRVVPVVVDS EDVGGWVPPI
SQLAFAPSGQ PSPAGSVDFL VGLARVEPDA YQLTAQWVVE VVQSGVWLRP PLSGESAPAV
RERPWRSGRM AIVVGVPGAP VDDEARRALR ELLVRLPAPV RGRVDLYPRE AESLAREYTP
PTDSPDIALV PARTEPPMWW QGDERLFSVL VEIDDDGQVR TEEGALGPGE LGELITAHSQ
RAGRPILLVS NTDVPVEFCQ RVADQVPAIV ITGSPHAGGW STIRPRQVGH DELPPSRLES
PYPLADHDLD DMLNEQAVPA AEVTWQPAFA NRRRLDLRTK RSVADTGDGT TVPGETDAPD
RVALLGRGPG ARAPFRVFGR LASEWHVALT IATQRPEWIH CDHLITVEID AVDPLSLQLL
ANYLDAPLAA RRQRVDGRPA SSDQPGWILA RPRRVVGPTT SGFRMPGRGE AMDGTPTPRR
AIALPDTAHL STFPVPAEQA AVTAPDPTPA EPDPVPDSTT APPAPPSTPT RTPVAASTTP
DPGPHTVEPE ASTSGGSAIS GAPASGALPA LTAPVEEAVP LLAQTGGGAI SGGQASSALP
DLTAPVDKGV PLPVLASTSS PARADARGPR RRRRLAGAVL VAATVAATGG AVVALRDAVD
AGAADVRRAA EVTTSSEPAP TAGGAAPQLT VSPSAPPTAS EHPSRTPAPA ASMSPVTRPT
RAGAEQTPPA AAGRPTSTST TVTGRPNTTK RNLALGRVAA ASSTETSTLA VENVVDGDRT
TRWSSEWNDD PQWLGIDLGS VWAVTEVRLI WEEAYATRYR VELSPDGAHW TAVYSTGNGA
GGTVRISVTS AAARYVRLVF DQRGTMWGYS LWELDVR