Gene Sare_1360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1360 
Symbol 
ID5705574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1568335 
End bp1572078 
Gene Length3744 bp 
Protein Length1247 aa 
Translation table11 
GC content72% 
IMG OID641270871 
Productfibronectin type III domain-containing protein 
Protein accessionYP_001536252 
Protein GI159036999 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.858944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000129137 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACCTGT TCCGCAGACT CCGGATCCCC TGGGCCGAGC GACCCGCCAC CAACCACATG 
GATGCCACCG ATGACGACAG GCCGCTGGTG CCGGCGACAC GGAGCGCGCA GGACACCGCG
ACGAGCCCGC CATCGGAGGC CCCGAACCCG CCACCACCGA CACTCCTTGA CCCGATGGCA
CCACCAACGG CCCTGGACCC GGCGGCACCA CCAACGGTCC TGGACCCGAT GGCACCACCG
ACGGCCCTGG ACCCGGTGGC GCTGCCCCGC TGTTTCGGCC CGGTGCCGAA CCACGCGCAC
AGCCCGCTGC CGGCCCTGAA CGCCGACGGC GAGGTGATTC CCGGCACCGG CATCCGAAAG
TTCGTCGACC CGCTGCCCAC GCTCGACTCG GGCTACCACC CAGCCGGCGG CCTGCCGGTG
GCGGTCCCGG ACACCATCAC CTACCCGGGC TGCGACTACT ACGAAATCGG TCTCCAGGAG
TACGCGCAGC GGCTGCACCG GGACCTTCCG GCAACCCGGC TGCGCGGGTA CCGGCAGCTC
AACCTCGGCA CCGACCCGAC CGGGCACAAC ACGGTCGCAC CACCGGAGCA CCCGGTTCAC
CTCGGCCCGG TGATCGTCGC GCGGCGCGGC CGGCCGGTGC GGGTCAAGTT CATCAACCAG
CTTCCCACCG GGCAGGCCGG TGAGCTGTTC CTACCCGTCG ATCTGAGCAT CGACGGCGCC
GGCCGGGGAC CGCTCGACGG ACCGGCACCG TACCCGCAGA ACCGGGCGGT GCTGCACCTC
GCGGGCGCGC AGACCGGTTG GATCAGCGCC GGAACCCCAT GGCAGTGGAT CACACCCGCC
GCAGAGGTCA CCCCGTACCC GACCGGTCCT GGCCTGGCCC ACGTGCCGGA CATGCCGCCG
CCCGGCATCG GGGCCACCAC GCTGTACTAC CCGAACGAAC AGAGCGGGCG GCTCCTCGCG
TTCCACGACA ACACCGCCGG ACTCGCCCGA CTCACAATCT ACTCTGGTCA GCTCGGCCTT
TACCTGCTCA CCGATCCCGC CGAAGCTGGC CTGGTCGCCG ACGGCGTGCT GCCCGCCGAC
CAGATCCCCC TGGTCTTCGA GGACAAGACG TTCGTGCCCG ACGACAGCCA GCTGGCCGAC
ACCGACCCGA CCTGGGACCG GGACCGGTGG GGCGCCCGAG GCAGCCTCTG GCACCCGCAC
GTCTACCAAC CCCGGCAGAA CCCGTACCGA GACGACGGCA TCAACCCGAC CGGACGCTGG
GACTACGGAC CGTGGACGCG CACCGCCGAG GAGATGGGGG TGAACGGCAC GACCGGGGCG
GACGGCATCG GAGCGGGCCC GGTGCCGAAC CCACACCACG ACCCGGTCGG TGACACGCCG
GATGAGCCAC CGATGACACC AGGCGTACCG CACCCGTCGG CGGTACCCGC CGCATACGGC
GACACCGTGC TGGTCAACGG GGTGGCGTAC CCGTGGCTGA CGGTCGAACC GCGCACCTAT
CGGTTCCGCG TCCTCAACGC CTGCCTGGAC CGCAGCCTGA ACCTTCAGCT CTATCGGGCG
CGCGGCACCG GACCGATGTG GGACCGCAAC GGCGAACTGG CCGACCCGGA CGCCGGCGAG
GTTCCGATGG TGGCGGCGAC ACGGACACCA GACCGACCCG CCCGATGGCC GACAGACGGA
CGGGAAGGCG GAGTGCCGGA CCCGCTCGCC GCCGGCCCCG ACCTGGTGCA GATCGGCAAC
GAAGGAGGCC TCCTGCCGGC CCCGGCGGTA CTCCCCAGCC AACCGGTGAC CTACCGACAC
GACCGGCGCG ACCCCACCGT CCTCAACATC GACCGGCACG CGCTGCTGCT CGCCCCCGGA
GAACAGGCCG ACGTGCTGGT CGACTTCGCC ACCGTCGCCC CCGGCAGCAC CGTGCTCCTC
TACAACGACG CCCCGGCCCC GCTCCCCGAC TTCGACCCCC GGTACGACCA CCACACCGAA
GCCCCGGACC ACACCGCCGA AGGTGGGACA CCACCGAGCC GACCCGGGTA CGGGCCGAAC
ACCCGCACCC TGCTGCAGTT CCGGGTGGCC GGCACCCCGG CGCCCCACTA CGACCTGGCC
CGGCTCCGGG AACGGCTACC CCGGGCGTAC GCGGCCGGTC AGCTTCCGCC GATCGTGCCG
CAACCCGCCT ACGACGCGGC GTTCGGCACC CGCACCGCCC GCGAAACCAC AGTCCCGGTG
CACGCCACCA CAGTCGGTTT CACCCCGGCC GGGGGTACCG ATCCCGTCCT GCTGCCGCTC
GCCGTGAAGG CCGTCCAGCA GGTCTTCGAA CCGGCACACG GCCGGCTCGC CGGCCGGCTC
GGCGTCGGGC CCCCACCCGG TGGCCCGCTC GCCCCCGCCG TGGTGTCGCT CAACCCCACC
GACCCGGCCA CCGAGATGGT GCAGGTCGGT GAGCCGACGG TCCCGATGGG CCCGGCCGTC
GACGGCACAC AGCTGTGGCG GATCCGTGGC GCCAGCCGGC AGACCCAGCC GATCCGCCTC
GACGGTGTCG ACCTGCAACT GATCAACCGG GCAGGGTGGG ACGGCACGCT CCGCCCACCG
GACCCGAACG AGGTGGGCTG GAAACAGGTC ATCCGGGTCA ACCCGCGCGA GGACGTGGTG
GTCGCGCTGC ACCCGGTCGC GCCGCCCCTC CCATTCAAGA TCGCCGACAG CGTACGGCCG
CTGGACCCGA GCCGGCCGGG CGACGTCCAC ACCGGAGGAT TCTCGCTGAC CTGTCGCGCC
GCCCCGGCGG TCAACCAGCC CGTCAACCTG GGCTGGGAAT ACCGCTGGCA CAGCCAGCTC
GCCGGATACC GGGACCAGGG CATGTGCCGA CCACTGGTGT TGCGGGTCGC CCCACAGGCT
CCGACCGGGT TGACCGCGAC CCCGGGCCCC GGCTCGGCCA CCGCCCTGCC GGCCATCCTG
CTCGCCTGGA CCGGCAACGG CAACCCGCCC GCCGCCACCA GCCACCTGCT GCAACGGGCC
ACCGACCCGA CGTTCACCGA CGGGCTGACC GCGATCACGG TGGCAGCCAG CGCCACCCAC
TACACCGACG CGACCGTCAC TCCGGGAGTG ACCTACCACT ACCGCATCCG GGCGGAGAAC
GCGGTCAGCT GCTCGGCATG GTCGAACTGT GCGCCCGCGT CGGTGCGGCT CGCCGCACCG
ACCAGCCTGG CCGCGGTGGT GCCGCCGACG GCCCCGCTGC GGGTGGCGCT GCACTGGCGT
AACCGTTCCT TCGCCACCGG CGTCGACGTG CAGCGCGCCA CCAACCCCAC TTTCACCAGC
GGGCCCGGCA CCACGGCCAT CAGCGTGGGC GACACCCACC TCGACCCCGC CGTCGTACCG
AACACCCGCT ACTACTACCG GGTCCGCACC ACCTACCTGG GGGCGGCGTC ACCGTTCTCC
ACTGTGGCGC AGGTGACCAC ACCGCCCCGG CCCGGCACAC CGGAGGCGGT GACCGTCACC
GCGACCGCGT CCGCCCCGGA CACCGCGACC GTCATCCTGG GCTGGGCCGC GAACGCCCCC
GCCGGGCCGG GCGGCGGATT CACCGTGCAG CGGGCCGCGG ACCCGAACTT CACCCGCGAG
GTCGCCACCT TCACCGTCAA CGGGCGGGGG TTCACCAACA CCGGCCTGGC CCGCAGGGCC
ACCTACCACT ACCGGATCCG CGCGTTCAAC GTCGTCGGGA CGTCCCCCTT CACTAACCCG
GTCGCGGTGA CGACGCCGGA CTGA
 
Protein sequence
MDLFRRLRIP WAERPATNHM DATDDDRPLV PATRSAQDTA TSPPSEAPNP PPPTLLDPMA 
PPTALDPAAP PTVLDPMAPP TALDPVALPR CFGPVPNHAH SPLPALNADG EVIPGTGIRK
FVDPLPTLDS GYHPAGGLPV AVPDTITYPG CDYYEIGLQE YAQRLHRDLP ATRLRGYRQL
NLGTDPTGHN TVAPPEHPVH LGPVIVARRG RPVRVKFINQ LPTGQAGELF LPVDLSIDGA
GRGPLDGPAP YPQNRAVLHL AGAQTGWISA GTPWQWITPA AEVTPYPTGP GLAHVPDMPP
PGIGATTLYY PNEQSGRLLA FHDNTAGLAR LTIYSGQLGL YLLTDPAEAG LVADGVLPAD
QIPLVFEDKT FVPDDSQLAD TDPTWDRDRW GARGSLWHPH VYQPRQNPYR DDGINPTGRW
DYGPWTRTAE EMGVNGTTGA DGIGAGPVPN PHHDPVGDTP DEPPMTPGVP HPSAVPAAYG
DTVLVNGVAY PWLTVEPRTY RFRVLNACLD RSLNLQLYRA RGTGPMWDRN GELADPDAGE
VPMVAATRTP DRPARWPTDG REGGVPDPLA AGPDLVQIGN EGGLLPAPAV LPSQPVTYRH
DRRDPTVLNI DRHALLLAPG EQADVLVDFA TVAPGSTVLL YNDAPAPLPD FDPRYDHHTE
APDHTAEGGT PPSRPGYGPN TRTLLQFRVA GTPAPHYDLA RLRERLPRAY AAGQLPPIVP
QPAYDAAFGT RTARETTVPV HATTVGFTPA GGTDPVLLPL AVKAVQQVFE PAHGRLAGRL
GVGPPPGGPL APAVVSLNPT DPATEMVQVG EPTVPMGPAV DGTQLWRIRG ASRQTQPIRL
DGVDLQLINR AGWDGTLRPP DPNEVGWKQV IRVNPREDVV VALHPVAPPL PFKIADSVRP
LDPSRPGDVH TGGFSLTCRA APAVNQPVNL GWEYRWHSQL AGYRDQGMCR PLVLRVAPQA
PTGLTATPGP GSATALPAIL LAWTGNGNPP AATSHLLQRA TDPTFTDGLT AITVAASATH
YTDATVTPGV TYHYRIRAEN AVSCSAWSNC APASVRLAAP TSLAAVVPPT APLRVALHWR
NRSFATGVDV QRATNPTFTS GPGTTAISVG DTHLDPAVVP NTRYYYRVRT TYLGAASPFS
TVAQVTTPPR PGTPEAVTVT ATASAPDTAT VILGWAANAP AGPGGGFTVQ RAADPNFTRE
VATFTVNGRG FTNTGLARRA TYHYRIRAFN VVGTSPFTNP VAVTTPD